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The present invention describes the identification, isolation, sequencing and characterization of two human presenilin genes, PS-t and 
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GENETIC SEQUENCES AND PROTEINS 
REUITED TO ALZHEIMER'S DISEASE, 
AND USES THEREFOR 



Cross Reference To Related Applications 

This application is a Continuation- In- Part of U.S. 
application Serial No. 08/509,359, filed on July 31, 1995, which 
is a Continuation- In- Part of U.S. application Serial No. 
5 08/496,841, filed on June 28, 1995, which is a Continuation-in- 
Part of U.S. Application Serial No. 08/431,048, filed on April 
28, 1995, all of which were entitled GENETIC SEQUENCES AND 
PROTEINS RELATED TO ALZHEIMER'S DISEASE (Inventors: Peter H. St. 
George-Hyslop, Johanna M. Rotnmens and Paul £. Fraser) , and all of 
10 which are incorporated herein by reference. 

Field of the Invention 

The present invention relates generally to the field of 
neurological and physiological dysfunctions associated with 
Alzheimer's Disease. More particularly, the invention is 

15 concerned with the identification, isolation and cloning of genes 
which are associated with Alzheimer's Disease, as well as their 
treuxscripts, gene products, associated sequence information, and 
related genes. The present invention also relates to methods for 
detecting and diagnosing carriers of normal and mutant alleles of 

20 these genes, to methods for detecting and diagnosing Alzheimer's 
Disease, to methods of identifying genes and proteins related to 
or interacting with the Alzheimer's genes and proteins, to 
methods of screening for potential therapeutics for Alzheimer's 
Disease, to methods of treatment for Alzheimer's Disease, and to 

25 cell lines and animal models useful in screening for and 

evaluating potentially useful therapies for Alzheimer's Disease. 

Background of the Invention 

In order to facilitate reference to various journal 
articles, a listing of the articles is provided at the end of 
30 this specification. 

Alzheimer's Disease (AD) is a degenerative disorder of the 
htunan central nervous system characterized by progressive memory 
impairment and cognitive and intellectual decline during mid to 
late adult life (Katzman, 1986) . The disease is accompanied by 
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a constellation of neuropathologic features principal amongVt 
which are the presence of extracellular amyloid or senile plaques 
and the neurofibrillary degeneration of neurons. The etiology of 
this disease is complex, although in some families it appears to 
5 be inherited as an autosomal dominant trait. However, even 
amongst these inherited forms of AD, there are at least three 
different genes which confer inherited susceptibility to this 
disease (St. George-Hyslop et al., 1990). The €4 (C112R) allelic 
polymorphism of the Apolipoprotein E (ApoE) gene has been 

10 associated with AD in a significant proportion of cases with 

onset late in life (Saunders et al., 1993; Strittmatter et al., 
1993) . Similarly, a very small proportion of familial cases with 
onset before age 65 years have been associated with mutations in 
the ^-amyloid precursor protein (APP) gene ( Chart ier-Harlin et 

15 al,, 1991; Goate et al., 1991; Murrell et al., I99l; Karlinsky et 
al., 1992; Mullan et al., 1992). A third locus (AD3) associated 
with a larger proportion of cases with early onset AD has 
recently been mapped to chromosome I4q24.3 (Schellenberg et al., 
1992; St. George-Hyslop et al., 1992; Van Broeckhoven et al., 

20 X992). 

Although the chromosome I4q region carries several genes 
which could be regarded as candidate genes for the site of 
mutations associated with AD3 (e.g., cFOS, alpha-l- 
antichymotrypsin, and cathepsin G) , most of these candidate genes 
25 have been excluded on the basis of their physical location 

outside the AD3 region and/or the absence of mutations in their 
respective open reading frames (Schellenberg et al., 1992; Van 
BroecJchoven et al., 1992; Rogaev et al., 1993; Wong et al., 
1993). 

30 There have been several developments and commercial 

directions or strategies in respect of treatment of Alzheimer's 
Disease and diagnosis thereof. Published PCX application WO 94 
23049 describes transf ection of high molecular weight YAC DNA 
into specific mouse cells. This method may be used to analyze 

35 large gene complexes. For example, the transgenic mice may have 
increased APP gene dosage, which mimics the trisomic condition 
that prevails in Down's Syndrome, and allows the generation of 
animal models with /J-amyloidosis similar to that prevalent in 
individuals with Alzheimer's Disease. Published international 

40 application WO 94 00569 describes transgenic non-human animals 
harboizring large transgenes such as the transgene comprising a 
hximan APP gene. Such animal models can provide usefxxl models of 
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hiiman genetic diseases such as Alzheimer's Disease. 

Canadian Patent application No. 2096911 describes a nucleic 
acid coding for an APP- cleaving protease, which is associated 
with Alzheimer's Disease and Down's syndrome. The genetic 
5 information, which was isolated from chromosome 19, may be used 
to diagnose Alzheimer's Disease. Canadian Patent application 
2071105, describes detection and treatment of inherited or 
acquired Alzheimer's Disease by the use of YAC nucleotide 
sequences. The YACs are identified by the numbers 23CB10, 28CA12 

10 and 26FF3. 

U.S. Patent 5,297,562, describes detection of Alzheimer's 
Disease associated with trisomy of chromosome 21. Treatment 
involves methods for reducing the proliferation of chromosome 21 
trisomy. Canadian Patent application No, 2054302 describes 

15 monoclonal antibodies which recognize a human brain cell nucleus 
protein encoded by chromosome 21 and are used to detect changes 
of expression due to Alzheimer's Disease or Down's Syndrome. 
The monoclonal antibody is specific to a protein encoded by human 
chromosome 21 and is foxrnd in large pyramidal cells of human 

20 brain tissue. 

Summary of the Invention 

The present invention. is based, in part, upon the 
identification, isolation, cloning and sequencing of two 
mammalian genes which have been designated presenilin-1 (PSl) and 

25 presenilin-2 (PS2) . These two genes, and their corresponding 
protein products, are members of a highly conserved family of 
genes, the presenilins, with homologues or orthologues in other 
mammalian species (e.g., mice; rats) as well as orthologues in 
invertebrate species (e.g., C. eleoans. D. melanoaaster ^ . 

30 Mutations in these genes have been linked to the development in 
humans of forms of Familial Alzheimer's Disease and may be 
causative of other disorders as well (e.g., other cognitive, 
intellectual, neurological or psychological disorders such as 
cerebral hemorrhage, schizophrenia, depression, mental 

35 retardation and epilepsy) . The present disclosure provides 

genomic and cDNA nucleotide sequences for human PSl (hPSl) and 
human PS2 (hPS2) genes, a murine PSl homologue (mPSl) , and 
related genes from C. eleaans (sel-12, SPE-4) and D. melanoaaater 
(DmPS) . The disclosure also provides the predicted amino acid 

40 sequences of the presenilin proteins encoded by these genes and a 
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15 



20 



structural characterization of the presenilins. including 
putative functional domains and antigenic determinants. A number 
of mutations in the presenilins which are causative of 
Alzheimer's Disease (AD) in humans are also disclosed and related 
S to the functional domains of the proteins. 

Thus, -in one series of embodiments, the present invention 
provides isolated nucleic acids including nucleotide sequences 
comprising or derived from the presenilin genes and/or encoding 
polypeptides comprising or derived from the presenilin proteins. 
10 The presenilin sequences of the invention include the 

specifically disclosed sequences, splice variants of these 
sequences, allelic variants of these sequences, synonymous 
sequences, and homologous or orthologous variants of these 
sequences. Thus, for example, the invention provides genomic and 
CDNA sequences from the hPSl gene, the hPS2 gene, the mPSl gene, 
and the DmPS gene. The present invention also provides allelic' 
variants and homologous or orthologous sequences by providing 
methods by which such variants may be routinely obtained. The 
present invention also specifically provides for mutant or 
disease-causing variants of the presenilins by disclosing a 
number of specific mutant sequences and by providing methods by 
which other such variants may be routinely obtained. Because the 
nucleic acids of the invention may be used in a variety of 
diagnostic, therapeutic and recombinant applications, various 
25 subsets of the presenilin sequences and combinations of the 
presenilin sequences with heterologous sequences are also 
provided. For example, for use in allele specific hybridization 
screening or PCR amplification techniques, subsets of the 
presenilin sequences, including both sense and antisense 
30 sequences, and both normal and mutant sequences, as well as 

intronic, exonic and untranslated sequences, are provided. Such 
sequences may comprise a small number of consecutive nucleotides 
from the sequences which are disclosed or otherwise enabled 
herein but preferably include at least 8-10, and more preferably 
35 9-25, consecutive nucleotides from a presenilin sequence. Other 
preferred subsets of the presenilin sequences include those 
encoding one or more of the functional domains or antigenic 
determinants of the presenilin proteins and, in particular, may 
include either normal <wild-type) or mutant sequences. The 
invention also provides for various nucleic acid constructs in 



40 
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which presenilin sequences, either complete or subsets, are 
operably joined to exogenous sequences to form cloning vectors, 
expression vectors, fusion vectors, transgenic constructs, and 
the like. Thus, in accordance with another aspect of the 
5 invention, a recombinant vector for transforming a mammalian or 
invertebrate tissue cell to express a normal, or mutant presenilin 
sequence in the cells is provided. 

In another series of embodiments, the present invention 
provides for host cells which have been transfected or otherwise 
10 transformed with one of the nucleic acids of the invention. The 
cells may be trsuisf ormed merely for purposes of propagating the 
nucleic acid constructs of the invention, or may be transformed 
so as to express the presenilin sequences. The transformed cells 
of the invention may be used in assays to identify proteins 
15 and/or other compounds which affect normal or mutant presenilin 
eacpression, which interact with the normal or mutant presenilin 
proteins, and/or which modulate the function or effects of the 
normal or mutant proteins, or to produce the presenilin proteins, 
fusion proteins, f\inctional domains, antigenic determinants, 
20 and/or antibodies of the invention. Transformed cells may also 
be implanted into hosts, including humans, for therapeutic or 
other reasons. Preferred host cells include mammalian cells from 
neuronal, fibroblast, bone marrow, spleen, organotypic or mixed 
cell cultures, as well as bacterial, yeast, nematode, insect and 
25 other invertebrate cells. For uses as described belowj preferred 
cells also include embryonic stem cells, zygotes, gametes, and 
germ line cells. 

In another series of embodiments, the present invention 
provides transgenic animal models for AD and other diseases or 
30 disorders associated with mutations in the presenilin genes. The 
animal may be essentially any mammal, including rats, mice, 
hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, 
and non-human primates. In addition, invertebrate models, 
including nematodes cuid insects, may be used for certain 
35 applications. The animal models are produced by standard 

transgenic methods including microinjection, tramsf ection, or 
other forms of transformation of embryonic stem cells, zygotes, 
gametes, and germ line cells with vectors including genomic or 
cDNA fragments, minigenes, homologous recombination vectors, 
40 viral insertion vectors and the like. Suitable vectors include 
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vaccinia virus, adenovirus, adeno associated virus, retrovirus, 
liposome transport, neuraltropic viruses, and Herpes simplex 
virus. The cuiimal models may include treuisgenic sequences 
comprising or derived from the presenilins, including normal and 
5 mutant sequences, intronic, exonic and \mtranslated sequences, 
and sequences encoding subsets of the presenilins such as 
functional domains. The major types of animal models provided 
include: (1) Animals in which a normal human presenilin gene 
has been recombinantly introduced into the genome of the animal 

10 as an additional gene, under the regulation of either an 

exogenous or an endogenous promoter element, and as either a 
minigene or a large genomic fragment; in which a normal hximan 
presenilin gene has been recombinantly substituted for one or 
both copies of the animal's homologous presenilin gene by 

15 homologous recombination or gene targeting? auid/or in which one 
or both copies of one of the animal's homologous presenilin genes 
have been recombinaoitly '*hiunanized" by the partial substitution 
of sequences encoding the human homologue by homologous 
recombination or gene targeting . (2) Animals in which a mutant 

20 human presenilin gene has been recombinantly introduced into the 
genome of the animal as an additional gene, under the regulation 
of either an exogenous or an endogenous promoter element, and as 
either a minigene or a large genomic fragment; in which a mutant 
human presenilin gene has been recombinantly substituted for one 

25 or both copies of the animal's homologous presenilin gene by 

homologous recombination or gene targeting; and/or in which one 
or both copies of one of the animal's homologous presenilin genes 
have been recombinantly "humanized" by the partial substitution 
of sequences encoding a mutant human homologue by homologous 

30 recombination or gene targeting. (3) Animals in which a mutant 
version of one of that animal's presenilin genes has been 
recombinantly introduced into the genome of the ainimal as an 
additional gene, under the regulation of either an exogenous or 
an endogenous promoter element, and as either a minigene or a 

35 large genomic fragment; and/or in which a mutant version of one 
of that animal's presenilin genes has been recombinantly 
sxibstituted for one or both copies of the cuiimal's homologous 
presenilin gene by homologous recombination or gene targeting. 
(4) ■'Knock-out" animals in which one or both copies of one of 

40 the animal's presenilin genes have been partially or completely 
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deleted by homologous recombination or gene targeting, or have 
been inactivated by the insertion or substitution by homologous 
recombination or gene targeting of exogenous sequences, in 
preferred embodiments, a transgenic mouse model for AD has a 
5 transgene encoding a normal hximan PSl or PS2 protein, a mutant 
human or murine PSl or PS2 protein, or a humanized normal or 
mutant murine PSl or PS2 protein. 

In another series of embodiments, the present invention 
provides for substantially pure protein preparations including 

10 polypeptides comprising or derived from the presenilins proteins. 
The presenilin protein sequences of the invention include the 
specifically disclosed sequences, variants of these sequences 
resulting from alternative mRNA splicing, allelic variants of 
these sequences, and homologous or orthologous variants of these 

15 sequences. Thus, for example, the invention provides amino acid 
sequences from the hPSl protein, the hPS2 protein, the mPSl 
protein, and the DmPS protein. The present invention also 
provides allelic variants and homologous or orthologous proteins 
by providing methods by which such variants may be routinely 

20 obtained. The present invention also specifically provides for 
mutant or disease -causing variants of the presenilins by 
disclosing a number of specific mutant sequences and by providing 
methods by which other such variants may be routinely obtained . 
Because the proteins of the invention may be used in a variety of 

25 diagnostic, therapeutic and recombinant applications, various 
subsets of the presenilin protein sequences and combinations of 
the presenilin protein sequences with heterologous sequences are 
also provided. For example, for use as immunogens or in binding 
assays, subsets of the presenilin protein sequences, including 

30 both normal and mutant sequences, are provided. Such protein 

sequences may comprise a small number of consecutive amino acid 
residues from the sequences which are disclosed or otherwise 
enabled herein but preferably include at least 4-8, and 
preferably at least 9-15 consecutive amino acid residues from a 

35 presenilin sequence. Other preferred subsets of the presenilin 
protein sequences include those corresponding to one or more of 
the functional domains or antigenic determinants of the 
presenilin proteins and, in particular, may include either normal 
(wild- type) or mutant sequences. The invention also provides for 

40 various protein constructs in which presenilin sequences, either 
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complete or subsets, are joined to exogenous sequences to form 
fusion proteins and the like. In accordance with these 
embodiments, the present invention also provides for methods of 
producing all of the above described proteins which comprise, or 
5 are derived from, the presenilins. 

In another series of embodiments, the present invention 
provides for the production and use of polyclonal and monoclonal 
antibodies, including soitibody fragments, including Fah 
fragments, ^{aLb*)^, and single chain antibody fragments, which 

10 selectively bind to the presenilins, or to specific antigenic 

determinants of the presenilins. The antibodies may be raised in 
mouse, rabbit, goat or other suitable animals, or may be produced 
recombinantly in cultured cells such as hybridoma cell lines. 
Preferably, the antibodies are raised again presenilin sequences 

15 comprising at least 4-8, and preferably at least 9-15 consecutive 
amino acid residues from a presenilin sequence. The antibodies 
of the invention may be used in the various diagnostic, 
therapeutic and technical applications described herein. 

In another series of embodiments, the present invention 

20 provides methods of screening or identifying proteins, small 
molecules or other compounds which are capcQ>le of inducing or 
inhibiting the expression of the presenilin genes and proteins 
(e.g., PSl or PS2) . The assays may be performed in vitro using 
non- transformed cells, immortalized cell lines, or recombinant 

25 cell lines, or in vivo using the transgenic animal models enabled 
herein. In particular, the assays may detect the presence of 
increased or decreased expression of PSl, PS2 or other 
presenilin-related genes or proteins on the basis of increased or 
decreased mRNA expression, increased or decreased levels of 

30 presenilin-related protein products, or increased or decreased 
levels of expression of a marker gene (e.g., /?-galactosidase, 
green fluorescent protein, alkaline phosphatase or lucif erase) 
operably joined to a presenilin 5' regulatory region in a 
recombinant construct. Cells known to express a particular 

35 presenilin, or transformed to express a particular presenilin, 
are incubated and one or more test compounds are added to the 
medium. After allowing a sufficient period of time (e.g., 0-72 
hours) for the compound to induce or inhibit the expression of 
the presenilin, any change in levels of expression from an 

40 established baseline may be detected using any of the techniques 
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described above. In particularly preferred embodiments, the 
cells are from an immortalized cell line such as a human 
neuroblastoma, glioblastoma or a hybridoma cell line, or are 
transformed cells of the invention. 
5 In another series of embodiments, the present invention 

provides methods for identifying proteins and other compoxmds 
which bind to, or otherwise directly interact with, the 
presenilins. The proteins and compounds will include endogenous 
cellular components which interact with the presenilins in vivo 

10 and which, therefore, provide new targets for pharmaceutical and 
therapeutic interventions, as well as recombinant, synthetic and 
otherwise exogenous compounds which may have presenilin binding 
capacity and, therefore, may be candidates for pharmaceutical 
agents. Thus, in one series of embodiments, cell lysates or 

15 tissue homogenates (e.g., human brain homogenates, lymphocyte 
lysates) may be screened for proteins or other compounds which 
bind to one of the normal or mutant presenilins. Alternatively, 
any of a variety of exogenous compounds, both naturally occurring 
and/or synthetic (e.g., libraries of small molecules or 

20 peptides), may be screened for presenilin binding capacity. In 
each of these embodiments, an assay is conducted to detect 
binding between a "presenilin component" and some other moiety. 
The "presenilin component" in these assays may be any polypeptide 
comprising or derived from a normal or mutant presenilin protein, 

25 including fxinctional domains or antigenic determinants of the 
presenilins, or presenilin fusion proteins. Binding may be 
detected by non-specific measures (e.g., changes in intracellular 
Ca^*, GTP/GDP ratio) or by specific measures {e.g., changes in A/S 
peptide production or changes in the expression of other 

3 0 downstream genes which can be monitored by differential display, 
2D gel electrophoresis, differential ixybridization, or SAGE 
methods) . The preferred methods involve variations on the 
following techniques: (1) direct extraction by affinity 
chromatography; (2.) co-isolation of presenilin components and 

35 bound proteins or other compovmds by immunoprecipitation; (3) 

the Biomolecular Interaction Assay (BIAcore) ; and (4) the yeast 
two-hybrid systems. 

In another series of embodiments, the present invention 
provides for methods of identifying proteins, small molecules and 

40 other con^oimds capable of modulating the activity of normal or 
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mutant presenilins. Using normal cells or animals, the 
transformed cells and transgenic animal models of the present 
invention, or cells obtained from subjects bearing normal or 
mutant presenilin genes, the present invention provides methods 
5 of identifying such confounds on the basis of their ability to 
affect the expression of the presenilins, the intracellular 
localization of the presenilins, intracellular Ca'*, Na*, K* or 
other ion levels or metabolism, the occurrence or rate of 
apoptosis or cell death, the levels or pattern of A/3 peptide 

10 production, the presence or levels of phosphorylation of 
microtubule associated proteins, or other biochemical, 
histological, or physiological markers which distinguish cells 
bearing normal and mutant presenilin sequences. Using the 
transgenic animals of the invention, methods of identifying such 

15 compounds are also provided on the basis of the etbility of the 
compounds to affect behavioral, physiological or histological 
phenotypes associated with mutations in the presenilins. 

In another series of embodiments, the present invention 
provides methods for screening for carriers of presenilin alleles 

20 associated with AD, for diagnosis of victims of AD, and for the 
screening and diagnosis of related presenile and senile 
dementias, psychiatric diseases such as schizophrenia and 
depression, and neurologic diseases such as stroke and cerebral 
hemorrhage, which associated with mutations in the PSl or PS2 

25 genes. Screening and/or diagnosis can be accomplished by methods 
based upon the nucleic acids (including genomic and mRNA/cDNA 
sequences), proteins, and/or cuitibodies disclosed and enea>led 
herein, including fuhctiohal assays designed to detect failure or 
augmentation of the normal presenilin activity and/or the 

30 presence of specific new activities conferred by the mutant 

presenilins. Thus, screens and diagnostics based upon presenilin 
proteins are provided which detect differences between mutant and 
normal presenilins in electrophoretic mobility, in proteolytic 
cleavage patterns, in molar ratios of the various amino acid 

35 residues, in ability to bind specific antibodies. In addition, 
screens and diagnostics based upon nucleic acids (gDNA, cDNA or 
mRNA) are provided which detect differences in nucleotide 
sequences by direct nucleotide sequencing, hybridization using 
allele specific oligonucleotides, restriction enzyme digest and 

40 mapping (e.g., RFLP. REF-SSCP) , electrophoretic mobility (e.g.. 
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deposits of particular nucleic acids disclosed herein have made 
with the ATCC (Rockville, MD) . These deposits include Accession 
Number 97124 (deposited April 28, 1995), Accession Number 97508 
(deposited on April 28, 1995), Accession Number 97214 (deposited 
5 on June 28, 1995), and Accession Number 97428 (deposited January 
26, 1996). 

Brief Descriptio n of the DrawincfA 
Figure 1: This figure is a representation of the structural 
organization of the hPSl genomic DNA. Non-coding exons are 

10 depicted by solid shaded boxes. Coding exons are depicted by 

open boxes or hatched boxes for alternatively spliced sequences. 
Restriction sites are: B = BamHI; E = EcoRI; H = Hindlll; N = 
Not I; P = Psti; V = Pvull; X = Xbal. Discontinuities in the 
horizontal line between restriction sites represent \mdefined 

15 genomic sequences. Cloned genomic fragments containing each exon 
are depicted by double-ended horizontal arrows. The size of the 
genomic subclones and Accession number for each genomic sequence 
are provided. 

Figure 2: This figure is a representation of a hydropathy 

20 plot of the putative PSl protein. The plot was calculated 
according to the method of Kyte and Doolittle (1982) . 

Figure 3: This figure presents a sequence alignment of the 
hPSl and mPSl protein sequences. Vertical bars indicate 
identical amino acids. 

25 Figure 4: This figure presents a sequence alignment of the 

hPSl and hPS2 protein sequences. Vertical bars indicate 
identical amino acids. 

Figure 5: This figure is a schematic drawing of the 
predicted structure of the PSl protein. Roman numerals depict 

30 the transmembrane domains. Putative glycosylation sites are 

indicated as asterisks and most of the phosphorylation sites are 
located on the same membrane face as the two acidic hydrophilic 
loops. The MAP kinase site is present at residue 115 and the PKC 
site at residue 114. FAD mutation sites are indicated by 

35 horizontal arrows. 

Figure 6: This figure is a schematic drawing of the 
predicted structure of the PS2 protein. Roman numerals depict the 
transmembrane domains. Putative glycosylation sites are 
indicated as asterisks and most of the phosphorylation sites are 

40 located on the same membrane face as the two acidic hydrophilic 
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loops. FAD mutation sites are indicated by horizontal arrows. 

D^t;ailQd Description of the Inv^T1^nr.r, 
^ • Definitions 

In order to facilitate review of the various embodiments of 
5 the invention, and an understanding of the various elements and 
constituents used in making and using the invention, the 
following definitions are provided for particular terms used in 
the description and appended claims: 

Py^senilin. As used without further modification herein, the 
10 terms 6presenilin6 or 6presenilins6 mean the presenilin-i (PSl) 
and/or the presenilin-2 <PS2) genes /proteins, in particular, the 
unmodified terms 6presenilin6 or 6presenilins6 refer to the 
mammalian PSl and/or PS2 genes/proteins and, preferably, the 
human PSl and/or PS2 genes/proteins. 

?ygg?^A;iin-J, qCT^T as used herein, the term "presenilin-i gene" 
or -PSl gene*^ means the mammalian gene first disclosed and 
described in U.S. Application Ser. No. 08/431,048, filed on April 
28, 1995, and later described in Sherrington et al. (1995), 
including any allelic variants and heterospecif ic mammalian 
20 homologues. One human presenilin-l (hPSl) cDNA sequence is 

disclosed herein as SEQ ID NO: 1. Another human cDNA sequence, 
resulting from alternative splicing of the hPSl mRNA transcript, 
is disclosed as SEQ ID NO: 3. Additional human splice variants,' 
as described below, have also been found in which a region 
25 encoding thirty-three residues may be spliced-out in some 

transcripts. A cDNA of the murine homologue (mPSl) is disclosed 
as SEQ ID NO: 16. The term "presenilin-1 gene" or "PSl gene" 
primarily relates to a coding sequence, but can also include some 
or all of the flanking regulatory regions and/or introns. The 
30 term PSl gene specifically includes artificial or recombinant 
genes created from cDNA or genomic DNA, including recombinant 
genes based upon splice variants- The presenilin-1 gene has also 
been referred to as the S182 gene (e.g., Sherrington et al., 
1995) or as the Alzheimer's Related Membrane Protein (ARMP) gene 
35 (e.g., U.S. Application Ser. No. 08/431,048, filed on April 28, 
1995) . 

py^gQnj.3,i^■l, pyot;ei,^, As used herein, the term "presenilin-1 
protein" or "PSl protein" means a protein encoded by a PSl gene, 
including allelic variants and heterospecif ic mammalian 
40 homologues. One human presenilin-1 (hPSl) protein sequence is 
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SSCP, DGGE) , PGR mapping, RNase protection, chemical mismatch 
cleavage, ligase -mediated detection, and various other methods. 
Other methods are also provided which detect abnormal processing 
of PSl, PS2, APP, or proteins reacting with PSl, PS2, or APP 
5 (e.g., abnormal phosphorylation, glycosylation, glycation 
amidation or proteolytic cleavage) alterations in presenilin 
transcription, translation, and post- trans lational modification; 
alterations in the intracellular and extracellular trafficking of 
presenilin gene products; or abnormal intracellular localization 

10 of the presenilins. In accordance with these embodiments, 

diagnostic kits are also provided which will include the reagents 
necessary for the above-described diagnostic screens. 

In another series of embodiments, the present invention 
provides methods and pharmaceutical preparations for use in the 

15 treatment of presenilin-associated diseases such as AD. These 
methods and pharmaceuticals are be based upon (1) administration 
of normal PSl or PS2 proteins, (2) gene therapy with normal PSl 
or PS2 genes to compensate for or replace the mutant genes, (3) 
gene therapy based upon antisense sequences to mutant PSl or PS2 

20 genes or which "knock-out" the mutant genes, (4) gene therapy 
based upon sequences which encode a protein which blocks or 
corrects the deleterious effects of PSl or PS2 mutants, (5) 
immunotherapy based upon antibodies to normal and/or mutant PSl 
or PS2 proteins, or (6) small molecules (drugs) which alter PSl 

25 or PS2 expression, block cODnormal interactions between mutant 
forms of PSl or PS2 and other proteins or ligands, or which 
otherwise block the aberrant function of mutant PSl or PS2 
proteins by altering the structure of the mutant proteins, by 
enhancing their metabolic clearance, or by inhibiting their 

30 function. 

In accordance with another aspect of the invention, the 
proteins of the invention caui be used as starting points for 
rational drug design to provide ligands, therapeutic drugs or 
other types of small chemical molecules. Alternatively, small 
35 molecules or other compounds identified by the above -described 
screening assays may serve as ^lead compounds" in rational drug 
design. 

Particularly disclosed nucleotide and amino acid sequences 
of the present invention are numbered SEQ ID NOs: 1-25. In 
40 addition, under the terms of the Budapest Treaty, biological 
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disclosed herein as SEQ ID NO: 2. Another human PSl protein 
sequence, resulting from alternative splicing of the hPSl mRNA 
transcript, is disclosed as SEQ ID NO: 4. Additional human 
splice variants, as described below, have also been found in 
5 which a region including thirty- three residues may be spliced- out 
in some transcripts. These variants are also embraced by the 
term presenilin-l protein as used herein. A protein sequence of 
the murine homologue (mPSl) is disclosed as SEQ ID NO: 17. The 
protein may be produced by recombinant cells or organisms, may be 

10 substantially purified from natural tissues or cell lines, or may 
be synthesized chemically or enzymatically. Therefore, the term 
"presenilin-l protein" or "PSl protein" is intended to include 
the protein in glycosylated, partially glycosylated, or 
unglycosylated forms, as well as in phosphorylated, partially 

15 phosphorylated, unphosphorylated, sulphated, partially sulphated, 
or unsulphated forms. The term also includes allelic variants 
and other fxmctional equivalents of the PSl amino acid sequence, 
including biologically active proteolytic or other fragments. 
This protein has also been referred to as the S182 protein (e.g., 

20 Sherrington et al., 1995) or as the Alzheimer's Related Membrane 
Protein (ARMP) (e.g., U.S. Application Ser. No. 08/431,048, filed 
on April 2B, 1995) . 

hPSl gene and/or protein. As used herein, the abbreviation 
"hPSl" refers to the human homologue and human allelic variants 

25 of the PSl gene and/or protein. Two cDNA sequences of the human 
PSl gene are disclosed herein as SEQ ID NO: l and SEQ ID NO: 3. 
The corresponding hPSl protein sequences are disclosed herein as 
SEQ ID NO: 2 and SEQ ID NO: 4. Numerous allelic variants, 
including deleterious mutants, are disclosed and enabled 

30 throughout the description which follows. 

mPSl gene and/or protein. As used herein, the abbreviation 
"mPSl" refers to the murine homologues and murine allelic 
variants of the PSl gene and/or protein. A cDNA sequence of one 
murine PSl gene is disclosed herein as SEQ ID NO: 16. The 

35 corresponding mPSl protein sequence is disclosed herein as SEQ ID 
NO: 17. Allelic variants, including deleterious mutants, are 
enabled in the description which follows. 

Presenilin-2 gene. As used herein, the term »presenilin-2 gene" 
or «'PS2 gene" means the mammalian gene first disclosed and 
40 described in U.S. Application Ser. No. 08/496,841, filed on June 
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28, 1995, and later described in Rogaev et al. (1995) and Levy- 
Lahad et al. (1995), including any allelic variants and 
heterospecif ic mairanalian homologues. One human presenilin-2 
(hPS2) cDNA sequence is disclosed herein as SEQ ID NO: 18. 
5 Additional human splice variants, as described below, have also 
been £ound in which a single codon or a region encoding thirty- 
three residues may be spliced-out in some transcripts. The term 
'•presenilin-2 gene" or "PS2 gene" primarily relates to a coding 
sequence, but can also include some or all o£ the flanking 

10 regulatory regions and/or introns. The term PS2 gene 

specifically includes artificial or recombinant genes created 
from cDNA or genomic DNA, including recombinant genes based upon 
splice variants. The presenilin-2 gene has also been referred to 
as the E5-1 gene {e.g., Rogaev et al . , 1995; U.S. Application 

15 Ser. No. 08/496,841, filed on June 28, 1995) or the STM2 gene 
(e.g., Levy-Lahad et al., 1995). 

pyeseRj.?.i^-2 pyo^^i,^, As used herein, the term "presenilin-2 
protein" or "PS2 protein" means a protein encoded by a PS2 gene, 
including allelic variants and heterospecif ic mammalian 

20 homologues. One human pre8enilin-2 (hPS2) protein sequence is 
disclosed herein as SEQ ID NO: 19. Additional human splice 
variants, as described below, have also been found in which a 
single residue or a region including thirty- three residues may be 
spliced-out in some tramscripts . These variants are also 

25 embraced by the term presenilin-2 protein as used herein. The 

protein may be produced by recombinant cells or organisms, may be 
substantially purified from natural tissues or cell lines, or may 
be synthesized chemically or enzymatically . Therefore, the term 
"presenilin-2 protein" or "PS2 protein" is intended to include 

30 the protein in glycosylated, partially glycosylated, or 

unglycosylated forms, as well as in phosphorylated, partially 
phosphorylated, unphosphorylated, sulphated, partially sulphated, 
or unsulphated forms. The term also includes allelic variants 
and other functional equivalents of the PS2 amino acid sequence, 

35 including biologically active proteolytic or other fragments. 

This protein has also been referred to as the E5-1 protein (e.g., 
Sherrington et al., 1995; U.S. Application Ser. No. 08/496,841, 
filed on June 28, 1995} or the STM2 protein (e.g., Levy-Lahad et 
al., 1995) . 

40 hPS2 gene and/or protein. As used herein, the abbreviation 
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"hPS2" refers to the human homologue and human allelic variants 
of the PS2 gene and/or protein. One cDNA sequences of the human 
PS2 gene is disclosed herein as SEQ ID NO: 18. The corresponding 
hPS2 protein sequence is disclosed herein as SEQ ID NO: 19. 
5 Numerous allelic variants, including deleterious mutants, are 
disclosed and ensdDled throughout the description which follows. 
DmPS gene and/or protein. As used herein, the abbreviation 
"DmPS« refers to the Droscphila homologues and allelic variants 
of the PSl and PS2 genes/proteins. This definition is xinderstood 

10 to include nucleic acid and amino acid sequence polymorphisms 
wherein substitutions, insertions or deletions in the gene or 
protein sequence do not affect the essential function of the gene 
product. The nucleotide sequence of one cDNA of the DmPS gene is 
disclosed herein as SEQ ID NO: 20 and the corresponding amino 

15 acid sequence is disclosed as SEQ ID NO: 21. The term "DmPS 

gene" primarily relates to a coding sequence but can also include 
some or all of the flsmking regulatory regions and/or introns, 
ypym^l . As used herein with respect to genes, the term 6normal6 
refers to a gene which encodes a normal protein. As used herein 

20 with respect to proteins, the term dnormalfi means a protein 

which performs its usual or normal physiological role and which 
is not associated with, or causative of, a pathogenic condition 
or state. Therefore, as used herein, the term dnormal6 is 
essentially synonymous with the usual meaning of the phrase dwild 

25 type. 6 For any given gene, or corresponding protein, a 

multiplicity of normal allelic variants may exist, none of which 
is associated' with the development of a pathogenic condition or 
state. Such normal allelic variants include, but are not limited 
to, variants in which one or more nucleotide sxabsti tut ions do not 

30 result in a change in the encoded amino acid sequence. 

Mutant . As used herein with respect to genes, the term dmutanto 
refers to a gene which encodes a mutant protein. As used herein 
with respect to proteins, the term dmutantd means a protein which 
does not perform its usual or normal physiological role and which 

35 is associated with, or causative of, a pathogenic condition or 
state. Therefore, as used herein, the term dmutantd is 
essentially synonymous with the terms ddysfunctional,6 
dpathogenicd ddi8ease*causing,6 and ddeleterious . 6 With respect 
to the presenilin genes and proteins of the present invention, 

40 the term dmutant6 refers to presenilin genes /proteins bearing one 
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or more nucleotide/amino acid substitutions, insertions and/or 
deletions which typically lead to the development of the syn^toms 
of Alzheimer's Disease euid/or other relevant inheritable 
phenotypes (e.g. cerebral hemorrhage, mental retardation, 
5 schizophrenia, psychosis, and depression) when expressed in 
humans. This definition is understood to include the various 
mutations that naturally exist, including but not limited to 
those disclosed herein, as well as synthetic or recombinant 
mutations produced by human intervention. The term "mutant," as 

10 applied to the presenilin genes, is not intended to embrace 
sequence variants which, due to the degeneracy of the genetic 
code, encode proteins identical to the normal sequences disclosed 
or otherwise enabled herein; nor is it intended to embrace 
sequence variants which, although they encode different proteins, 

15 encode proteins which are functionally equivalent to normal 
presenilin proteins. 

Functional equivalent. As used herein in describing gene 
sequences and amino acid sequences, the term "functional 
equivalent" means that a recited sequence need not be identical 

20 to a particularly disclosed sequence of the SEQ ID NOs but need 
only provide a sequence which functions biologically and/or 
chemically as the equivalent of the disclosed sequence. 
Sxibstantiallv pure. As used herein with respect to proteins 
(including antibodies) or other preparations, the term 

25 "substantially pure" means a preparation. which is at least 60% by 
weight (dry weight) the compound of interest. Preferably the 
preparation is at least 75%, more preferably at least 90%, and 
most preferably at least 99%, by weight the compoixnd of interest. 
Purity can be measured by any appropriate method, e.g., column 

30 chromatography, gel electrophoresis, or HPLC analysis. 

With respect to proteins, including antibodies, if a 
preparation includes two or more different compounds of interest 
(e.g., two or more different antibodies, immunogens, functional 
domains, or other polypeptides of the invention), a 

35 "substantially pure" preparation means a preparation in which the 
total weight (dry weight) of all the compounds of interest is at 
least 60% of the total dry weight. Similarly, for such 
preparations containing two or more compounds of interest, it is 
preferred that the total weight of the compoiinds of interest be 

40 at least 75%, more preferably at least 90%, and most preferably 
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at least 99%, of the total dry weight of the preparation. 
Isolated nuclei c acid. As used herein, an ** isolated nucleic 
acid" is a ribonucleic acid, deoxyribonucleic acid, or nucleic 
acid analog comprising a polynucleotide sequence that has been 
5 isolated or separated from secjuences that are immediately 
contiguous (one on the 5' end and one on the 3' end) in the 
naturally occurring genome of the organism from which it is 
derived. The term therefore includes, for example, a recombinant 
nucleic acid which is incorporated into a vector, into an 

10 autonomously replicating plasmid or virus, or into the genomic 
DNA of a prokaryote or eukaryote; or which exists as a separate 
molecule (e.g., a cDNA or a genomic DNA fragment produced by PGR 
or restriction endonuclease treatment) independent of other 
sequences. It also includes a recombinant DNA which is part of a 

15 hybrid gene encoding additional polypeptide sequences and/or 
including exogenous regulatory elements. 
Substantially identical aemienee. As used herein, a 
"substantially identical" amino acid sequence is an amino acid 
sequence which differs only by conservative amino acid 

20 substitutions, for example, substitution of one amino acid for 

another of the same class (e.g., valine for glycine, arginine for 
lysine, etc.) or by one or more non-conservative substitutions, 
deletions, or insertions located at positions of the amino acid 
sequence which do not destroy the function of the protein 

25 (assayed, e.g,, as described herein). Preferably, such a 
sequence is at least 85%, more preferably 90%, and most 
preferably 95% identical at the amino acid level to the sequence 
of the protein or peptide to which it is being compared. For 
nucleic acids, the length of comparison sequences will generally 

30 be at least 50 nucleotides, preferably at least 6 0 nucleotides, 
more preferably at least 75 nucleotides, and most preferably no 
nucleotides. A "substantially identical" nucleic acid sequence 
codes for a substantially identical amino acid sequence as 
defined above. 

35 Transformed cell. As used herein, a "transformed cell" is a cell 
into which (or into an ancestor of which) has been introduced, by 
means of recombinsuit DNA techniques, a nucleic acid molecule of 
interest. The nucleic acid of interest will typically encode a 
peptide or protein. The trauisformed cell may express the 

40 sequence of interest or may be used only to propagate the 
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sequence. The term "transformed" may be used herein to embrace 
any method of introducing exogenous nucleic acids including, but 
not limited to, transformation, transfection, electroporation, 
microinjection, viral -mediated transf action, and the like. 
5 Operablv joined. As used herein, a coding sequence and a 

regulatory region are said to be "operably joined" when they are 
covalently linked in such a way as to place the expression or 
transcription of the coding sequence under the influence or 
control of the regulatory region. If it is desired that the 

10 coding sequences be translated into a functional protein, two DNA 
sequences are said to be operably joined if induction of promoter 
function results in the transcription of the. coding sequence and 
if the nature of the linkage between the two DNA sequences does 
not (1) result in the introduction of a frame-shift mutation, (2) 

15 interfere with the ability of the regulatory region to direct the 
transcription of the coding sequences, or (3) interfere with the 
ability of the corresponding RNA transcript to be translated into 
a protein. Thus, a regulatory region would be operably joined to 
a coding sequence if the regulatory region were capable of 

20 effecting transcription of that DNA sequence such that the 

resulting transcript might be translated into the desired protein 
or polypeptide. 

Stringent hybridization c onditions. Stringent hybridization 
conditions is a term of art understood by those of ordinary skill 

5 in the art. For any given nucleic acid sequence, stringent 
hybridization conditions are those conditions of temperature, 
chaotrophic acids, buffer, and ionic strength which will permit 
hybridization- of that nucleic acid sequence to its complementary 
sequence and not to substantially different sequences. The exact 

0 conditions which constitute "stringent" conditions, depend upon 
the natxxre of the nucleic acid sequence, the length of the 
sequence, and the frequency of occurrence of subsets of that 
sequence within other non-identical sequences. By varying 
hybridization conditions from a level of stringency at which non- 

5 specific hybridization occurs to a level at which only specific 
hybridization is observed, one of ordinary skill in the art can, 
without undue experimentation, determine conditions which will 
allow a given sequence to hybridize only with complementary 
sec[uences. Suitable ranges of such stringency conditions are 

0 described in Krause and Aaronson (1991) . Hybridization 
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conditions, depending upon the length and commonality of a 
sequence, may include temperatures of 20°C-65»C and ionic 
strengths from 5x to O.lx SSC. Highly stringent hybridization 
conditions may include temperatures as low as 40-42 ®C (when 
5 denaturants such as formamide are included) or up to 6O-6SOC in 
ionic strengths as low as O.lx SSC. These ranges, however, are 
only illustrative and, depending upon the nature of the target 
sequence, and possible future technological developments, may be 
more stringent than necessary. Less than stringent conditions 
10 are employed to isolate nucleic acid sequences which are 
substantially similar, allelic or homologous to any given 
sequence . 

Selectively binds. As used herein with respect to antibodies, an 
antibody is said to "selectively bind" to a target if the 

15 antibody recognizes euid binds the target of interest but does not 
substantially recognize and bind other molecules in a sample, 
e,g., a biological san^le, which includes the target of Interest. 
II. T^Q pyggen^lins 

The present invention is based, in part, upon the discovery 

20 of a family of mammalian genes which, when mutated, are 

associated with the development of AlzheimerOs Disease. The 
discovery of these genes, designated presenilin-l and presenilin- 
2, as well as the characterization of these genes, their protein 
products, mutamts, and possible functional roles, are described 

25 below. Invertebrate homologues of the presenilins are also 
discussed as they may shed light on the function of the 
presenilins and to the extent they may be useful in the various 
embodiments described below. 
1. Isolation of the Human Presenilin-l Gene 

30 A. Genetic Mapping of the AD3 Region 

The initial isolation and characterization of the PSl gene, 
then referred to as the AD3 gene or S182 gene, was described in 
Sherrington et al (1995) . After the initial regional mapping of 
the AD3 gene locus to I4q24.3 near the auionymous microsatellite 

35 markers D14S43 and D14S53 (Schellenberg et al., 1992; St. George- 
Hyslop et al., 1992; Van Broeclchoven et al., 1992), twenty one 
pedigrees were used to segregate AD as a putative autosomal 
dominant trait (St. George-Hyslop et al«, 1992) and to 
investigate the segregation of 18 additional genetic markers from 

40 the 14q24.3 region which had been organized into a high density 
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genetic linkage map (Weissenbach et al., 1992; Gyapay et al., 
1994) . Previously published pairwise maximum likelihood analyses 
confirmed substantial cumulative evidence for linkage between 
familial Alzheimer's Disease (FAD) and all of these markers. 
5 However* much of the genetic data supporting linkage to these 
markers were derived from six large early onset pedigrees, FADl 
(Nee et al., 1983), FAD2 (Frommelt et al., 1991), FAD3 (Goudsmit 
et al., 1981; Pollen, 1993), FAD4 (Foncin et al . , 1985), TORI . 1 
(Bergamini, 1991) and 603 (Pericak- Vance et al., 1988), each of 

10 which provides at least one anonymous genetic marker from I4q24.3 
(St. George-Hyslop et al., 1992). 

In order to define more precisely the location of the AD3 
gene relative to the known locations of the genetic markers from 
14q24.3, recombinational landmarks were sought by direct 

15 inspection of the raw haplotype data from those geno typed 

affected members of the six pedigrees showing definitive linkage 
to chromosome 14 . This selective strategy in this particular 
instance necessarily discards data from the reconstructed 
genotypes of deceased affected members as well as from elderly 

20 asymptomatic members of the large pedigrees, and takes no account 
of the smaller pedigrees of uncertain linkage status. However, 
this strategy is very sound because it also avoids the 
acquisition of potentially misleading genotype data acquired 
either through errors in the reconstructed genotypes of deceased 

25 affected members arising from non-paternity or sampling errors or 
from the inclusion of unlinked pedigrees. 

Upon inspection of the haplotype data for affected s\ibjects, 
members of the six large pedigrees whose genotypes were directly 
determined revealed obligate recombinants at D14S48 and D14S53, 

30 and at D14S258 and D14S63. The single recombinant at D14S53, 

which depicts a telomeric boxindary for the FAD region, occurred 
in the same AD affected subject of the FADl pedigree who had 
previously been found to be recombinant at several other markers 
located telomeric to D14S53, including D14S48 (St. George-Hyslop 

35 et al., 1992). Conversely, the single recombinant at D14S258, 

which marks a centromeric boundary of the FAD region, occurred in 
an affected member of the FAD3 pedigree who was also recombinant 
at several other markers centromeric to D14S258 including D14S63 . 
Both recombinant subjects had unequivocal evidence of Alzheimer's 

40 Disease confirmed through stamdard clinical tests for the illness 
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in other affected members of their families, and the genotype of 
both recombinant siibjects was informative and co- segregating at 
multiple loci within the interval centromeric to D14S53 and 
telomeric to D14S258. 
5 When the haplotype analyses were enlarged to include the 

reconstructed genotypes of deceased affected members of the six 
large pedigrees as well as data from the remaining fifteen 
pedigrees with probabilities for linkage of less than 0.95, 
several additional recombinants were detected at one or more 

10 marker loci within the interval between D14S53 and D14S258. 
Thus, one additional recombinant was detected in the 
reconstructed genotype of a deceased affected member of each of 
three of the larger FAD pedigrees (FADl, FAD2 and other related 
families) , and eight additional recombinants were detected in 

15 affected members of five smaller FAD pedigrees. However, while 
some of these recombinants might have correctly placed the AD3 
. gene within a more defined target region, it was necessary to 
regard these potentially closer "internal recombinants" as 
unreliable not only for the reasons discussed earlier, but also 

20 because they provided mutually inconsistent locations for the AD3 
gene within the D14S53-D14S258 interval. 

B. Construction of a Physical Contia S panning the AD3 Region 

As an initial step towards cloning the AD3 gene, a contig of 
overlapping genomic DNA fragments cloned into yeast artificial 

25 chromosome vectors, phage artificial chromosome vectors and 
cosmid vectors was constructed. FISH mapping studies using 
cosmids derived from the YAC clones 932c7 and 964f5 suggested 
that the interval most likely to carry the AD3 gene was at least 
five megabases in size. Because the large size of this minimal 

30 CO- segregating region would make positional cloning strategies 
intractable, additional genetic pointers were sought which 
focused the search for the AD3 gene to one or more subregions 
within the interval flanked by D14S53 and D14S258. Haplotype 
analyses at the markers between D14S53 and D14S258 failed to 

35 detect statistically significant evidence for linkage 

disequilibrium and/or allelic association between the FAD trait 
and alleles at any of these markers, irrespective of whether the 
analyses were restricted to those pedigrees with early onset 
forms of FAD, or were generalized to include all pedigrees. This 

40 result was not unexpected given the diverse ethnic origins of our 
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pedigrees. However, when pedigrees of similar ethnic descent 
were collated, direct inspection of the haplotypes observed on 
the disease-bearing chromosome segregating in different pedigrees 
of similar ethnic origin revealed two clusters of marker loci. 
5 The first of these clusters located centromeric to D14S77 

(D14S786. D14S277 and D14S268; and spanned the 0.95 Mb physical 
interval contained in YAC 78842. The second cluster was located 
telomeric to D14S77 (D14S43, D14S273, and D14S76) and spanned the 
- 1Mb physical interval included within the overlapping YAC 
10 clones 964c2, 74163, 7.97dll and part of 854f5. Identical alleles 
were observed in at least two pedigrees from the same ethnic 
origin. As part the strategy, it was reasoned that the presence 
of shared alleles at one of these groups of physically clustered 
marker loci might reflect the co- inheritance of a small physical 
15 region surrounding the PSl gene on the original founder 

chromosome in each ethnic population. Significantly, each of the 
shared extended haplotypes were rare in normal Caucasian 
populations and allele sharing was not observed at other groups 
of markers spanning similar genetic intervals elsewhere on 
20 chromosome 14q24.3. 

C. Transcription Mapping and Anal vs is o f Candidate Genes 

To isolate expressed sequences encoded within both critical 
intervals, a direct selection strategy was used involving 
immobilized, cloned, human genomic DNA as the hybridization 
25 target to recover transcribed sequences from primary 

complementary DNA pools derived from human brain mRNA (Rommens et 
al., 1993). Approximately 900 putative cDNA fragments of size 
100 to 600 base pairs were recovered from these regions. These 
fragments were hybridized to Southern blots containing genomic 
30 ONAs from each of the overlapping YAC clones and genomic DNAs 
from humans and other mammals. This identified a subset of 151 
clones which showed evidence for evolutionary conservation and/or 
for a complex structure which suggested that they were derived 
from spliced mRNA. The clones within this subset were collated 
35 on the basis of physical map location, cross-hybridization and 
nucleotide sequence, and were used to screen conventional humsui 
brain cDNA libraries for longer cDNAs. At least 19 independent 
cDNA clones over 1 kb in length were isolated and then aligned 
into a partial treuiscription map of the AD3 region. Only three 
40 of these transcripts corresponded to knom characterized genes 
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(cFOS, dihydrolipoatnide succinyl transferase, and latent 
transforming growth factor binding protein 2) , 
D. Recovery of Candidate Genes 

Each of the open reading frame portions of the candidate 
5 genes were recovered by RT-PCR from mRNA isolated from post- 
mortem brain tissue of normal control subjects and from either 
post-mortem brain tissue or cultured fibroblast cell lines of 
affected members of six pedigrees definitively linked to 
chromosome 14, The RT-PCR products were then screened for 
10 sequence differences using chemical cleavage and restriction 

endonuclease fingerprinting single-strand sequence conformational 
polymorphism methods (Saleeba and Cotton, 1993; Liu and Sommer, 
1995), and by direct nucleotide sequencing. With one exception, 
all of the genes examined, although of interest, did not contain 

15 alterations in sequences that were unique to affected subjects, 
or co-segregated with the disease. The single exception was the 
candidate gene represented by clone S182 which contained a series 
of nucleotide changes not observed in normal subjects, and which 
were predicted to alter the amino acid sequence in affected 

20 subjects. The gene corresponding to this clone has now been 
designated as presenilin-1 (PSl) . Two psi cDNA sequences, 
representing alternative splice variants described below, are 
disclosed herein as SEQ ID NO: 1 and SBQ ID NO: 3. The 
corresponding predicted amino acid sequences are disclosed as SEQ 

25 ID NO: 2 and SEQ ID NO: 4, respectively. Bluescript plasmids 
bearing clones of these cDNAs have been deposited at the ATCC, 
RocJcville, Md., under ATCC Accession Numbers 97124 and 97508 on 
April 28, 1995. Sequences corresponding to SEQ ID NO: 1 and SEQ 
ID NO: 2 have also been deposited in the GenBank database and may 

30 be retrieved through Accession # 42110. 

2. Isolation of the Murine Presenilin-l Gene 

A murine homologue (mPSl) of the human PSl gene was 
recovered by screening a mouse cDNA library with a labelled human 
DNA probe from the hPSl gene. In this manner, a 2 kb partial 

35 transcript (representing the 3' end of the gene) and several RT- 
PCR products representing the 5' end were recovered. Sequencing 
of the consensus cDNA transcript of the murine homologue revealed 
substantial amino acid identity with hPSl. Importantly, as 
detailed below, all of the amino acids that were mutated in the 

40 FAD pedigrees were conserved between the murine homologue and the 
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normal human variant. This conservation of the PSl gene 
indicates that an orthologous gene exists in the mouse (mPSl) , 
and that it is now possible to clone other mammalian homologues 
or orthologues by screening genomic or cDNA libraries using human 
5 PSl probes. Thus, a similar approach will make it possible to 
identify and characterize the PSl gene in other species. The 
nucleic acid sequence of the mPSl clone is disclosed herein as 
SEQ ID NO: 16 and the corresponding amino acid sequence is 
disclosed as SEQ ID NO: 17. Both sequences have been deposited 
10 in the GenBank database and may be retrieved through Accession # 
42177. 

3. Isolation of the Human Presenilin-2 Gene 

A second human gene, now designated presenilin-2 (PS2) , has 
been isolated and demonstrated to share substantial nucleotide 

15 and amino acid homology with the PSl gene. The initial isolation 
of this gene is described in detail in Rogaev et al. (1995). 
Isolation of the human PS2 gene (referred to as ••STM2") by nearly 
identical methods is also reported in Levy-Lahad et al. (1995). 
Briefly, the PS2 gene was identified by using the nucleotide 

20 sequence of the cDNA for PSl to search data bases using the 
BLASTN paradigm of Altschul et al. (1990). Three expressed 
sequence tagged sites (ESTs) identified by Accession #s T03796, 
R14600, and R05907 were located which had substantial homology (p 
< 1.0 e*"^ greater than 97% identity over at least 100 contiguous 

25 base pairs) . - 

Oligonucleotide primers were produced from these sequences 
and used to generate PGR products by reverse transcriptase PGR 
(RT-PCR) . These short RT- PGR products were partially sequenced 
to confirm their identity with the sequences within the data base 

30 and were then used as hybridization probes to screen full-length 
CDNA libraries. Several different cDNAs ranging in size from i 
kb to 2.3 kb were recovered from a cancer cell cDNA library 
(Gaco2) and from a human brain cDNA library (E5-1, 61-1, cc54, 
cc32) . The nucleotide sequence of these clones confirmed that 

35 all were derivatives of the same transcript. 

The gene encoding the transcript, the PS2 gene, mapped to 
human chromosome 1 using hybrid mapping panels to two clusters of 
CEPH Mega YAC clones which have been placed upon a physical 
contig map (YAC clones 750g7, 921dl2 mapped by FISH to lq41; and 

40 YAG clone 787gl2 mapped to Ip36.1-p35). The nucleic acid 
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sequence of the hPS2 clone is disclosed herein as SEQ id NO: 18 
and the corresponding amino acid sequence is disclosed as SEQ ID 
NO: 19. Both sequences have been deposited in the GenBank 
database and may be retrieved through Accession # L44577. The 
5 DNA sequence of the hPS2 clone also has been incorporated into a 
vector and deposited at the ATCC, Roc3cville, MD., under ATCC 
Accession Number 97214 on J\me 28, 1995. 
4. Identification of Homolocrues in C. el eoans and D. 
melanoCTaster 

10 A. SPE-4 of C. eleaans 

Comparison of the nucleic acid and predicted amino acid 
sequences of PSl with available databases tising the BLAST 
alignment paradigms revealed modest amino acid similarity with 
the C. eleqans sperm integral membrane protein SPE-4 (P = l.5e-- 

15 25, 24-37% identity over three groups of at least fifty residues) 
and weaker similarity to portions of several other membrane 
spanning proteins including mammalian chromogranln A and the 
alpha sub\mit of mammalian voltage dependent calcium channels 
(Altschul et al., 1990). Amino-acid sequence similarities across 

20 putative transmembrane domains may occasionally yield alignment 
that simply arises from the limited number of hydrophobic amino 
acids, but there is also extended sequence alignment between PSl 
and SPE-4 at several hydrophilic domains. Both the putative PSl 
protein and SPE-4 are predicted to be of comparable size (467 and 

25 465 residues, respectively) and, as described more fully below, 
to contain at least seven transmembrane domains with a large 
acidic domain preceding the final predicted transmembrauie domain. 
The PSl protein does have~"a longer predicted hydrophilic region 
at the N terminus. 

30 BLASTP alignment analyses also detected significant homology 

between PS2 and the C. eleqans SPE-4 protein (p » 3.5e-26; 
identity « 20-63% over five domaizis of at least 22 residues) , and 
weak homologies to brain sodium channels (alpha III subunit) and 
to the alpha subunit of voltage dependent calcium channels from a 

35 variety of species {p = 0.02; identities 20-28% over two or more 
domains each of at least 35 residues) (Altschul, 1990) . These 
alignments are similar to those described ahove for the PSl gene. 
B. Sel-12 of C, eleqans 

The 461 residue Sel-12 protein from C. eleqans and S182 (SEQ 

40 ID NO: 2) were found to share 48% sequence identity over 460 
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amino acids (Levitan and Greenwald, 1995) . The Sel-12 protein 
also is believed to have multiple transmembrane domains. The 
sel-12 gene (Accession number U35660) was identified by screening 
for suppressors of a lin-12 gain-of -function mutation, and was 
5 cloned by transformation rescue (Levitan and Greenwald, 1995) . 
C. DmPS of D, melanoaaster 

Redundant oligonucleotides coding for highly conserved 
regions of the presenilin/sel 12 proteins were prepared and used 
to identify relevant mRNAs from adult and embryonic 

10 melanoaaster. These mRNAs were sequenced and shown to contain an 
open reading frame with a putative amino acid sequence highly 
homologous to that of the human presenilins. The DmPS cDNA is 
identified as SEQ ID NO: 20, 

This sequence encodes a polypeptide of 541 amino acids (SEQ 

15 ID NO: 21) with about 52% identity to the human presenilins. 

The structure of the D. melanoqaster homologue is similar to 
that of the humaui presenilins with at least seven putative 
transmembrane domains (Kyte-Doolittle hydrophobicity analyses 
using a window of 15 and cut-off of 1.5) . Evidence of at least 

20 one alternative splice form was detected in that clone pdsl3 

contained an ORF of 541 amino acids, whereas clones pds7, pdsl4 
and pdsl lacked nucleotides 1300-1341 inclusive. This 
alternative splicing would result in the alteration of Gly to Ala 
at residue 384 in the putative TM6-»7 loop, and an in- frame fusion 

25 to the Glu residue at codon 399 of the longer ORF, The principal 
differences between the amino acid sequence of the p. 
melanoaaster and human genes were in the N-terminal acid 
hydrophilic domain and in- the acidic hydrophilic portion of the 
TM6->7 loop. The residues surrounding the TM6-»7 loop are 

30 especially conserved (residues 220-313 and 451-524), suggesting 
that these are fimctionally important domains. Sixteen out of 
twenty residues identified to be mutated in human PSl or PS2 and 
giving rise to human FAD are conserved in the D. melanoaaster 
homologue. 

35 The DNA sequence of the DmPS gene as cloned has been 

incorporated into a Bluescript plasmid. This stable vector was 
deposited with the ATCC, Rockville, MD,, under ATCC Accession 
Ntunber 97428 on January 26, 1996. 

5* Characterization of the Human Presenllin Genes 
40 A. hPSl Transcripts and Gene g ^ructure 
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Hybridization of the PSl (S182) clone to northern blots 
identified a transcript expressed widely in many areas of brain 
and peripheral tissues as a major • 2 . 8 kb transcript and a minor 
transcript of - 7.5 kb (see. e.g., Figure 2 in Sherrington et 
5 al., 1995). PSl is expressed fairly uniformly in most regions of 
the brain and in most peripheral tissues except liver, where 
transcription is low. Although the identity of the - 7.5 kb 
tramscript is unclear, two observations suggest that the - 2.8 kb 
transcript represents an active product of the gene. 

10 Hybridization of the PSl clone to northern blots containing mRNA 
from a variety of murine tissues, including brain, identifies 
only a single transcript identical in size to the - 2.8 kb human 
transcript. All of the longer cDNA clones recovered to date 
(2.6-2.8 kb) , which include both 5' and 3' UTRs and which account 

15 for the - 2.8 kb band on the northern blot, have mapped 
exclusively to the same physical region of chromosome 14. 

From these experiments the - 7.5 kb transcript could 
represent either a rare alternatively spliced or polyadenylated 
isoform of the - 2.8 )cb transcript, or could represent another 

20 gene with homology to PSl. A cDNA library from the Caco2 cell 

line which expresses high levels of both PSl and PS2 was screened 
for long transcripts. Two different clones were obtained, GL40 
and B53, Sequencing revealed that both clones contained a 
similar 5'. UTR and an ORF which was identical to that of the 

25 shorter 2.8 kb transcripts in brain. 

Both clones contained an unusually long 3' UTR. This long 
3' UTR represents the use of an alternate polyadenylation site 
approximately 3 kb further downstream. Thiis long 3' UTR contains 
a number of nucleotide sequence motifs which result in 

30 palindromes or stem- loop structures. These structures are 

associated with mRNA stability and also translational efficiency. 
The utility of this observation is that it may be possible to 
create recombinant expression constructs and/or transgenes in 
which the upstream polyadenylation site is ablated, thereby 

35 forcing the use of the downstream polyadenylation site and the 
longer 3' UTR. In certain instances, this may promote the 
stability of selected mRNA species, with preferential translation 
that could be utilized to. alter the balsuice of mutant versiis 
wild- type transcripts in targeted cell lines, or even in vivo in 

40 the brain, either by germ line therapy or by the use of viral 
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vectors such as modified herpes simplex virus vectors as a form 
of gene therapy. 

The hPSl gene spams a genomic interval of at least 60 kb 
within a 200 kb PACl clone RPCI-1 54D12 from the Roswell Park PAC 
5 library and three overlapping cosmid clones 57-HlO, 1-G9; and 24- 
D5 from the Los Alamos Chromosome 14 cosmid library. Transcripts 
of the PSl gene contain RNA from 13 exons which were identified 
by reiterative hybridization of oligonucleotide and partial cDNA 
probes to subcloned restriction fragments of the PAC and cosmid 

10 clones, and by direct nucleotide sequencing of these subclones. 
The 5' UTR is contained within Exons 1-4, with Exons 1 and 2 
representing alternate 5' ends of the transcript. The ORF is 
contained in Exons 4 to 13, with alternative splicing events 
resulting in the absence of part of Exon 4 or all of Exon 9. 

15 Exon 13 also includes the 3' UTR. 

Unless stated otherwise, in the interests of clarity and 
brevity, all references to nucleotide positions in hPSl derived 
nucleotide sequences will employ the base nximbering of SEQ ID NO: 
1 (L42110) , an hPSl cDNA sequence starting with Exon l. In this 

20 cDNA, Exon 1 is spliced directly to Exon 3, which is spliced to 
Exons 4-13. In SEQ ID NO: 1, Exon 1 spans nucleotide positions 1 
to 113, Exon 3 spans positions 114 to 195, Exon 4 spans positions 
196 to 335, Exon 5 spans positions 336 to 586, Exon 6 spans 
positions 587 to 728, Exon 7 spans positions 729 to 796, Exon 8 

25 spans positions 797 to 1017, Exon 9 spans positions 1018 to 1116, 
Exon 10 spans positions 1117 to 1203, Exon 11 spans positions 
1204 to 1377, Exon 12 spans positions 1378 to 1496, Exon 13 spans 
positions 1497 to 2765. Similarly, unless stated otherwise, all 
references to amino acid residue positions in hPSl derived 

30 protein sequences will employ the residue numbering of SEQ ID NO: 
2, the translation product of SEQ ID NO: i. 

Flanking genomic sequences have been obtained for Exons l- 
12, and are presented in SEQ ID NOs: 5-14 (Accession numbers: 
L76518-L76527) . Genomic sequence 5' from Exon 13 has also been 

35 determined and is presented in SEQ ID NO: 15 (Accession number: 
Ii76528) . SEQ ID NOs: 5-14 also include the complete Exon 
sequences. SEQ ID NO: 15, however, does not include the 3' end 
of Exon 13. The genomic sequences corresponding to Exons 1 and 2 
are located approximately 240 bp apart on a 2.6 kb BamHI-Hindlll 

40 fragment, SEQ ID NO: 5. Exons 3 and 4 (which contains the ATG 
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Start codon) are located on a separate 3 kb BamHl fragment. The 
complete sequence of Intron 2 between the BamHI site -850 bp 
dovmstreara of Exon 2 and the BamHI site -600 bp upstream of Exon 
3 has not yet been identified, and was not immediately recovered 
5 by extended PCR using primers from the flanking BamHI sites, 
implying that Intron 2 may be large. 

Analysis of the nucleotide sequence surrounding Exons l and 
2 (SEQ ID NO: 5) revealed numerous CpG dinucleotides including a 
NotI restriction site in Intron 1. Consensus sequences for 

10 several putative transcriptional regulatory proteins including 
multiple clusters of Activator Protein-2 (AP-2) , Signal 
Transducers and Activators of Transcription (STAT3) (Schindler 
and Darnell, 1995) , Gamma Activator Secpiences (GAS or STATl) , 
Multiple start site Element Downstream (MED) (Ince and Scotto, 

15 1995), and GC elements were present in both Intron 1 and in the 
sequence 5' from Exon 1 (see SEQ id NO: 5). Two putative TATA 
boxes exist upstream of Exon 1, at bp 925-933 and 978-987 of SEQ 
ID NO: 5, and are followed by two putative transcription 
initiation (CAP or Chambon-Trif onov) consensus secpaences at 1002- 

20 1007 bp and 1038-1043 bp 484 of SEQ ID NO: 5. In contrast, the 
sequences immediately upstream of Exon 2 lack TATA boxes or CAP 
sites, but are enriched in clusters of CpG islands. 

A schematic map of the structural organization of the hPSl 
gene is presented as Figure 1. Non-coding exons are depicted by 

25 solid shaded boxes. Coding exons are depicted by open boxes or 
hatched boxes for alternatively spliced sequences. Restriction 
sites are indicated as: B « BamHI,- E » EcoRI; H « Hindlll; N - 
NotI; P a PstI; V » PvuII; x » Xbal. Discontinuities in the 
horizontal line between restriction sites represent undefined 

30 genomic sequences. Cloned genomic fragments containing each exon 
are depicted by double-ended horizontal arrows. The size of the 
genomic subclones and Accession number for each genomic sequence 
are also provided. 

Predictions of DNA secondary structure based upon the 

35 nucleotide sequence within 290 bp upstream of Exon 1 and within 
Intron 1 reveal several palindromes with stability greater than - 
16 kcal/mol. These secondary structure suialyses also predict the 
presence of three stable stem- loop motifs (at bp 1119-1129/1214- 
1224; at bp 1387-1394/1462-1469; and at bp 1422-1429/1508-1515; 

40 all in SEQ ID NO: 5} with a loop size sufficient to encircle a 
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xiucleosome (-'76 bp) « Such stem loop structures are a common 
feature of TATA containing genes (Kollmar and Famham, 1993} . 

A summary of the features in these 5' regions is presented 
in Table 1. All references to base positions are relative to SEQ 
5 ID NO: 5. 

The~ longest predicted open reading frame in SEQ 10 NO: 1 
encodes a protein of 467 amino acids, SEQ ID NO: 2. The start 
codon for this open reading frame is the first in-phase ATG 
located downstream of a TGA stop codon. There are no classical 
10 Kozak consensus sequences around the first two in phase ATG 
codons (Sherrington et al., 1995). Like other genes lacking 
classical ^strong' start codons, the putative 5' UTR of the human 
transcripts is rich in GC. 

B. Alternative Transcription and Splic ing of the hPSl 5' UTR 

15 Although the first three exons and part of the fourth exon 

contain non- translated sequences, analysis of multiple full 
length cDNA clones isolated from a human hippocampus cDNA library 
(Stratagene, La Jolla CA) and from a colon adenocarcinoma cell 
line (Caco2 from J. Rommens) revealed that in the majority of 

20 clones the initial sequences were derived from Exon 1 and were 
directly spliced to Exon 3 (Accession number L42110, SEQ ID NO: 
1). Less frequently (1 out of 9 clones), the initial transcribed 
sequences were derived from Exon 2 and were spliced onto Exon 3 
(Accession number L76517, SEQ ID NO: 3) . Direct nucleotide 

25 sequencing of at least 40 independent RT-PCR transcripts isolated 
using a primer in Exon 1 failed to identify any clones containing 
both Exon 1 and Exon 2. Finally, inspection of the genomic 
sequence upstream of Exon 2 did not reveal a 3' splice site 
sequence. These observations argue that Exon 2 is a true initial 

30 exon rather than an alternative splice form of treuiscripts 

beginning in Exon 1 or 2ui artifact of cDNA cloning. Furthermore, 
since a clone (cc44) containing Exon 2 was obtained from the same 
monoclonal Caco2 cell lines, it is likely that both Exon-1- 
containing transcripts and Exon* 2 -containing treuiscripts exist in 

35 the same cells. 

To test the predictions about transcription initiation sites 
based upon the nucleotide sequence of the 5' upstream region near 
Exon 1, we examined the 5' end sequence of three independent 
"full-length** cDNA clones containing Exon 1 (cc33, cc58 and cc48) 

40 and three sequences recovered by primer extension using an 
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antisense primer located in Exon 3. The furthest 5' extension 
was seen in the cDNA G40L, which mapped the most proximal 
transcription start site to position 1214 bp in the genomic 
sequence containing Exon 1 SEQ ID NO: 5 (L76518) , and which 
5 therefore corresponds to position -lo of SEQ ID NO: i. Two 
additional clones (cDNA cc48 and 5' RACE product #5) shared a 
common start site at position 1259 bp in the genomic sequence, 
SEQ ID NO: 5, which corresponds to position 34 in SEQ ID NO: 1. 
The two remaining cDNAs, as well as the remaining 5' RACE clones, 

10 began at more distal positions within Exon 1. A 5' RACE clone #8 
began at 1224 bp, equal to position l of SEQ ID NO: l. None of 
these clones therefore extended to the predicted CAP site 
upstream of Exon 1. Due to the low prevalence of transcripts 
containing initial sequences from Exon 2, similar studies of 

15 their start sites were not performed. 

C. Alternative Splic ing of the hPSi ORF 

In addition to transcripts with different initial sequences, 
the smalysis of multiple cDNA clones recovered from a variety of 
libraries also revealed two variations in PSl transcripts which 

20 affect the ORF. 

The first of these is the absence of 12 nucleotides from the 
3' end of Exon 4, nucleotides 324 to 335 of SEQ ID NO: 1. This 
would result from splicing of Exon 4 after nucleotide 323 instead 
of after nucleotide 335. Transcripts resulting from this 

25 alternative splicing of Exon 4 do not encode amino acid residues 
Val26-Arg27-Ser28-Gln29 of SEQ ID NO: 2. Transcripts resulting 
from these two alternative splicing events for Exon 4 were 
detected with approximately equal frequencies in all tissues 
surveyed. It is of note in the clones examined to date that the 

30 murine PSl transcripts do contain only the cDNA sequence for 

Ile26-Arg27-Ser28-Gln29, and that the sequence for the Val-Arg- 
Ser-Gln motif is only partially conserved in human PS2 as Arg48- 
Ser49-Gln50 (Rogaev et al.. 1995). Each of these observations 
suggests that these differences are not critical to proper PSl 

35 functioning. 

The second splicing variation affecting the ORF results in 
the absence of Exon 9, nucleotides 1018 to 1116 in SEQ ID NO: 1. 
Analysis of RT-PCR products derived from mRNA of a variety of 
tissues showed that brain (including neocortical areas typically 

40 affected by AD) and several other tissues (muscle, heart, l\ing, 
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colon) predominantly expressed a single transcript bearing Exon 
9. Leukocytes (but not lymphoblasts) on the other hand, also 
expressed a shorter form lacking Exon 9. Alternative splicing of 
Exon 9 is predicted to change an aspartate residue at position 
5 2S7 in SEQ ID NO: 2 to alanine, eliminate the next 33 residues, 
and result in an in- frame fusion to the rest of the protein 
beginning at the threonine at position 291 encoded in Exon 10. 
D. hPS2 Transcripts 

The genomic DNA including the human PS2 gene has not yet 

10 been fully characterized. Nonetheless, many similarities between 
the PSl and PS2 genes are apparent. The intron/exon boundaries 
of both genes, however, appear to be very similar or identical 
except in the region of the TM6-»7 loop. 

Hybridization of the PS2 cDNA clones to Northern Blots 

15 detected a -2.3 kb mRNA band in many tissues, including regions 
of the brain, as well as a *2.6kb mRNA band in muscle, cardiac 
muscle and pancreas. PS2 is expressed at low levels in most 
regions of the brain except the corpus callosum, where 
transcription is high. In skeletal muscle, cardiac muscle and 

20 pancreas, the PS2 gene is expressed at relatively higher levels 
than in brain and as two different transcripts of -2.3 kb and 
-2.6 kb. Both of the trcuiscripts have sizes clearly 
distinguishable from that of the 2.7 kb PSl transcript, and did 
not cross -hybridize with PSl probes at high stringency. The cDNA 

25 sequence of one hPS2 allele is identified as SEQ ID NO: 18 
(Accession No. L44577) . 

The longest ORF within this PS2 cDNA consensus nucleotide 
sequence predicts a polypeptide containing 448 amino acids (SEQ 
ID NO: 19) numbering from the first in-phase ATG codon, at 

30 positions 366-368 in SEQ ID NO: 18, which was surrounded by a 
Kozak consensus sequence. The stop codon is at positions 1710- 
1712. 

As for PSl, analysis of PS2 RT-PCR products from several 
tissues, including brain and muscle, RNA revealed two alternative 

35 splice variants in which a relatively large segment may be 

spliced out. Thus, at a relatively low frequency, transcripts 
are produced in which nucleotides 1152-1250 of the PS2 
transcript, SEQ ID NO: 18, (encoding residues 263-295, SEQ ID NO: 
19) are alternatively spliced. As discussed below, this splicing 

40 event corresponds closely to the alternative splicing of Exon 9 
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of PSl (Rogaev et al., 1995). 

An additional splice variant of the PS2 cDNA sequence 
lacking the GAA triplet at nucleotide positions 1338-1340 in SEQ 
ID NO: 18 has also been fo\md in all tissues examined. This 
5 alternative splice results in the omission of a Glu residue at 
amino acid position 325. 

6. Structure of the Presenilin Proteina 
A. The Presenilin Protein Family 

The presenilins are now disclosed to be a novel family of 
10 highly conserved integral membrane proteins with a common 
structural motif, common alternative splicing patterns, and 
common mutational regions hot spots which correlate with putative 
structural domains which are present in many invertebrate and 
vertebrate animal cells. Analysis of the predicted amino acid 

15 sequences of the human presenilin genes using the Hopp and Woods 
algorithm suggests that the proteins are multispanning integral 
membrane proteins such as receptors, channel proteins, or 
structural membrane proteins. A Kyte-Doolittle hydropathy plot 
of the putative hPSl protein is depicted in Figure 2. The 

20 hydropathy plot and structural analysis suggest that these 

proteins possess approximately seven hydrophobic transmembrane 
domains (designated TMl through TM7) separated by hydrophilic 
6loops.(5 other models can be predicted to have as few as 5 and 
as many as 10 transmembrane domains depending upon the parameters 

25 used in the prediction algorithm. The presence of seven membrane 
spanning domains, however, is characteristic of several classes 
of G-coupled receptor proteins, but is also observed with other 
proteins (e.g., channel proteins). The absence of a recognizable 
signal peptide and the paucity of glycosylation sites are 

30 noteworthy. 

The amino acid sequences of the hPSl and mPSl proteins are 
compared in Figure 3, and the sequences of the hPSl and hPS2 
proteins are compared in Figure 4. In each figure, identical 
amino acid residues are indicated by vertical bars. The seven 

35 putative transmembrane domains are indicated by horizontal lines 
above or below the secjuences. 

The major differences between members of this family reside 
in the amino acid sequences of the hydrophilic, acidic loop 
domains at the N- terminus and between the putative TM6 and TM7 

40 domains of the presenilin proteins (the TM6-*7 loop) . Most of the 
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residues encoded by hPSl Exon 9, which is alternatively spliced 
in some non-neural tissues, form part of the putative TM6-»7 loop. 
In addition, the corresponding alternative splice variant 
identified in hPS2 appears to encode part of the TM6-»7 loop. The 
5 variable splicing of this hydrophilic loop, and the fact that the 
- amino acid sequence of the loop differs between members of the 
gene family, suggest that this loop is an important functional 
domain of the protein and may confer some specificity to the 
physiologic and pathogenic interactions of the individual 

10 presenilin proteins. Because the N- terminal hydrophilic domain 
shares the same acidic charge as the TM6-»7 hydrophilic acid loop, 
' and in a seven transmembrane domain model is likely to have the 
same orientation with respect to the membrane, and is also 
variable amongst the presenilins, it is very likely that these 

15 two domains share functionality either in a coordinated or 
independent fashion (e.g. the same or different ligands or 
functional properties) . Thus, it is likely that the N-terminus 
is also an iciqportant functional domain of the protein and may 
confer some specificity to the physiologic and pathogenic 

20 interactions of the individual presenilin proteins. 

As detailed below, the pathogenic mutations in PSl and PS2 
cluster around the TMl-»2 loop and TM6-»7 loop domains, further 
suggesting that these domains are the functional domains of these 
proteins. Figures 5 and 6 depict schematic drawings of predicted 

25 structures of the PSl and PS2 proteins, respectively, with the 

known mutational sites indicated on the figures. As shown in the 
figures, the TMl-»2 linking sequence is predicted to reside on the 
opposite side of the membrane to that of the N-terminus and TM6-»7 
loop, and may be important in transmembrane communication. This 

30 is supported by the PSl YllSH mutation which was observed in a 
pedigree with early onset familial AD (30-40 years) and by 
additional mutations in the TMl/2 helices which might be expected 
to destabilize the loop. The TMl-»2 loop is relatively short 
(PSl: residues 101-132; PS2: residues 107-134) making these 

35 sequence more amenable to conventional peptide synthesis. Seven 
PSl mutations cluster in the region between about codon 62 and 
eodon 146, which comprises the putative first transmembrane 
domain (TMl) , the TMl-»2 loop, and the TM2 domain in PSl. 
Similarly, a mutation at codon 141 of PS2 is also located in the 

40 TM2 domain. These mutations proboibly destabilize the TMl-»2 loop 
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domain and its anchor points in TMX and TM2. Twelve PSl 
mutations result in the alteration of amino acids between about 
codons 246 and 410, which are involved in the TM6, TM6-»7 loop, 
and TM7 domains. These mutations may modify the structure or 
5 stability of the TM6-»7 loop (either directly or by modifying the 
conformation of TM6 or TM7) . 

Further evidence for an important functional role residing 
in the TM6-»7 loop is the sequence divergence in the central part 
of the TM6-»7 loop (approximately amino acids 3 00 to 371) among 

10 different members of the presenilin protein family. Similarly, 
because the N- terminus secjuences of members of the presenilin 
protein family are also divergent, it is likely that the slightly 
divergent sequences play a role in conferring specificity to the 
function of each of the different presenilin proteins while the 

15 conserved sequences confer the common biologic activities. These 
regions may represent ligand binding sites. If this is so, 
mutations in the TM6-»7 region are likely to modify ligand binding 
activity. The TMl-»2 loop, which is conserved amongst different 
members of the presenilin protein family, probably represents an 

20 effector domain on the opposing membrane face, with the 

exception of the Exon 10 splicing mutation, most of the other 
(missense) mutations align on the same surfaces of putative 
transmembrane helices, which suggests that they may affect ligand 
binding or channel functions. Thus, these domains (e.g., TM6-*7 

25 and TMl-»2 loops) can be used as sites to develop specific binding 
agents to inhibit the effects of the mutations and/or restore the 
normal function of the presenilin protein in subjects with 
Alzheimer's Disease. 

The similarity between the putative products of the Qj_ 

30 eleaans SPE-4 and the PSl genes implies that they may have 

similar activities. The SPE-4 protein appears to be involved in 
the formation and stabilization of the fibrous body-membrane 
organelle (FBMO) complex during spermatogenesis. The FBMO is a 
specialized Golgi -derived organelle, consisting of a membrane 

35 bound vesicle attached to and partly surrounding a con^lex of 

parallel protein fibers and may be involved in the trauisport and 
storage of soluble and membrsme -bound polypeptides. Mutations in 
SPS-4 disrupt the FBMO complexes and arrest spermatogenesis. 
Therefore the physiologic function of SPE-4 may be either to 

40 stabilize interactions between integral membrane budding and 
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fusion events, or to stabili2e interactions between the membrane 
and fibrillary proteins during the intracellular transport of the 
FBMO complex during spermatogenesis. Comparable functions could 
be envisaged for the presenilins. For example, PSl could be 
5 involved either in the docking of other membrane -bound proteins 
such as MPP/ or the axonal transport and fusion budding of 
membrane -bound vesicles during protein transport, such as in the 
Golgi apparatus or endosome-lysosorae system. If these hypotheses 
are correct, then mutations might be expected to result in 
10 aberrant transport and processing of /5APP and/or abnormal 

interactions with cytoskeletal proteins such as the microtubule - 
associated protein Tau. Abnormalities in the intracellular and 
in the extracellular disposition of both PAPP and Tau are in fact 
an integral part of the neuropathologic features of Alzheimer's 
15 Disease. Although the location of the PSl and PS2 mutations in 
highly conserved residues within conserved domains of the 
putative proteins suggests that they are pathogenic, at least 
three of these mutations are themselves conservative, which is 
commensurate with the onset of disease in adult life. Because 
20 none of the mutations observed so far are deletions or nonsense 
mutations that would be expected to cause a complete loss of 
expression or function, we cannot predict whether these mutations 
will have a dominant gain-of -function effect, thus promoting 
aberrant processing of /?APP or a dominant loss-of -function effect 
25 causing arrest of normal iSAPP processing. The Exon, 10 splicing 
mutation causes an in-frame fusion of Exon 9 to Exon 10, and may 
have a structural effect on the PSl protein which could alter 
intracellular targeting or ligand binding, or may otherwise 
affect PSl function. 
30 An alternative possibility is that the PSl gene product may 

represent a receptor or channel protein. Mutations of such 
proteins have been causally related to several other dominant 
neurological disorders in both vertebrate (e.g., malignant 
hyperthermia, hyperkalemic periodic paralysis in humans) and in 
35 invertebrate organisms (deg-i(d) mutants in C. eleaans ) . 

Although the pathology of these other disorders does not resemble 
that of Alzheimer's Disease, there is evidence for functional 
abnormalities in ion channels in Alzheimer's Disease. For 
example, anomalies have been reported in the tetra-ethylammonium- 
40 sensitive 113pS potassium channel and in calcium homeostasis. 
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Perturbations in transmembrane calcium fluxes might be especially 
relevant in view of the weak homology between PSl and the a- ID 
s\ibunit of voltage -dependent calcium channels and the observation 
that increases in intracellular calcium in cultured cells can 
5 replicate some of the biochemical features of Alzheimer's 
Disease, such as alteration in the phosphorylation of Tau- 
microtubule-associated protein and increased production of 
peptides. 

B. hp?; $tir^gtur^ 

10 As shown in SEQ ID NO: 2, the largest )cnown form of the 

human PSl protein comprises 467 amino acids and has a predicted 
molecular mass of approximately 51.37 kDa. A variant with the 
above -described alternative splicing of Exon 4 (in which the 
residues corresponding to positions 26-29 of SEQ ID NO: 2 are 

15 deleted) would include 4 fewer amino acids and have a mass of 
approximately 50.93 kDa. Similarly, a variant with the above- 
described alternative splicing of Exon 9 (in which the residues 
corresponding to positions 258-290 of SEQ ID NO: 2 are deleted) 
would include 33 fewer amino acids and would have a molecular 

20 mass of approximately 47.74 kDa. 

The positions of the putative domains are presented in Table 2. 
Note again that the nximbering of the residue positions is with 
respect to SEQ ID NO: 2 and is approximate (i.e. ± 2 residues). 
A schematic drawing of the putative PSl structure is shown 

25 in Fig. 5. The N-terminus is a highly hydrophilic, negatively 
charged domain with several potential phosphorylation domains, 
followed sequentially by a hydrophobic membrane spanning domain 
of approximately 19 residues (TMl) , a charged hydrophilic loop of 
approximately 32 residues (TMl-»2), five additional hydrophobic 

30 membraoie spanning domains (TM2 through TM6) interspersed with 
short (1-15 residue) hydrophilic domains (TM2-*3 through TM5-»6), 
an additional larger, acidic hydrophilic charged loop (TM6-*7} and 
at least one (TM7) , and possibly two, other hydrophobic 
potentially membraaie- spanning domains, culminating in a polar 

35 domain at the C-terminus. 

The protein also contains a number of potential 
phosphorylation sites, one of which is a MAP kinase consensus 
site which is also involved in the hyperphosphorylation of Tau 
during the conversion of normal Tau to neurofibrillary tangles. 

40 This consensus sequence may provide a putative element linking 
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this protein's activity to other biochemical aspects of 
Alzheimer's Disease, and would represent a likely therapeutic 
target. Review of the protein structure reveals two sequences 
YTPF (residues 115-118, SEQ ID NO: 2) and STPE (residues 353-356, 
5 SEQ ID NO: 2) which represent the 5/T-P motif which is the MAP 
kinase consensus sequence. Several other phosphorylation sites 
exist with consensus sequences for Protein Kinase C (PRC) 
activity. Because PKC activity is associated with differences in 
the metabolism of APP which are relevant to Alzheimer's Disease, 

10 these sites on the PSl protein and its homologues are also sites 
for targeting therapeutics. Preliminary evidence indicates that, 
at least in transfected cells, the PSl protein is phosphorylated 
only to a minor degree while the PS2 protein is significantly 
phosphorylated. For PS2 at least, it appears that this 

IS phosphorylation occurs on serine residues in the N- terminal 

domain by a mechanism which does not involve PKC (Capell et al., 
1996) . 

Note that the alternative splicing at the end of Exon 4 
removes four amino-acids from the hydrophilic N-terminal domain, 

20 and would be expected to remove a phosphorylation consensus 
sequence. In addition, the alternative splicing of Exon 9 
results in a truncated isoform of the PSl protein wherein the C- 
terminal five hydrophobic residues of TM6 and part of the 
hydrophilic negatively-charged TM6-»7 loop immediately C- terminal 

25 to TM6 is absent. This alternatively spliced isoform is 

characterized by preservation of the sequence from the N- terminus 
up to and including the tyrosine at position 256 of SEQ ID NO: 2, 
changing of the aspartate at position 257 to alanine, and 
splicing to the C- terminal part of the protein from and including 

30 tyrosine 291. Such splicing differences are often associated 
with important functional domains of the proteins. This argues 
that this hydrophilic loop (and consequently the N-terminal 
hydrophilic loop with similar amino acid charge) is/are active 
fxinctional domains of the PSl product and thus sites for 

35 therapeutic targeting. 
C. Human PS2 Structure 

The human PSl and PS2 proteins show 63% over- all amino acid 
identity and several domains display virtually complete identity. 
As would be expected, therefore, hydrophobicity analyses suggest 

40 that both proteins also share a similar structural organization. 
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Thus, both proteins are predicted to possess seven hydrophobic 
putative transmembrane domains, and both proteins bear large 
acidic hydrophilic domains at the N-terminus and between TM6 and 
TM7. A further similarity was apparent from the above-described 
5 analysis of RT-PCR products from brain and muscle RNA, which 
revealed that nucleotides 1153-1250 of the PS2 transcript are 
alternatively spliced- These nucleotides encode amino acids 263- 
296, which are located within the TM6-»7 loop domain of the 
putative PS2 protein and which share 94% sequence identity with 

10 the alternatively spliced amino acids 257-290 in PSl. 

The positions of the putative fiinctional domains of the hPS2 
protein are described in Table 3. Mote that residue positions 
refer to the residue positions of SEQ ID NO: 19, and that the 
positions are approximate (i.e., ± 2 residues). 

15 A schematic drawing of the putative PS2 structure is shown 

in Fig. 6, The similarity between hPSl and hPS2 is greatest in 
several domains of the protein corresponding to the intervals 
between TWl and TM6, emd from TM7 to the C- terminus of the PSl 
protein. The major differences between PSl and PS2 are in the 

20 size and amino acid sequences of the negatively- charged 

hydrophilic TM6-*7 loops, and in the sequences of the N-terminal 
hydrophilic domains. 

The most noticeable differences between the two predicted 
amino acid sequences occur in the amino acid sequence in the 

25 central portion of the TM6-*7 hydrophilic loop {residues 304-374 
of hPSl; 310-355 of hPS2) , and in the N-terminal hydrophilic 
domain. By analogy, this domain is also less highly conserved 
between the murine and human PSl genes (identity » 47/60 
residues) , and shows no similarity to the ecjuivalent region of 

30 SPE-4. 

7. Py^g^nj^l^ii^ Myit^afftg 
A. PSl Mutants 

Several mutations in the PSl gene have been identified which 
cause- a severe type of familial Alzheimer's Disease. One or a 

35 combination of these mutations may be responsible for this form 
of Alzheimer's Disease as well as several other neurological 
disorders. The mutations may be any form of nucleotide sequence 
substitution, insertion or deletion that leads to a change in 
predicted amino acid sequence or that leads to aberrant 

40 transcript processing, level or stability. Specific disease 
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causing mutations in the form of nucleotide and/or amino acid 
deletions or substitutions are described below but it is 
anticipated that additional mutations . will be found in other 
families. Indeed, after the initial discovery of five different 
5 missense mutations amongst eight different pedigrees (Sherrington 
et al. 1395), it was expected from experience with other 
inherited disease (e.g.. Amyotrophic lateral sclerosis associated 
with mutations in the Ca^* superoxide dismutase gene) that 
additional mutations would be identified. This expectation has 

10 been fulfilled by our subsequent discovery of additional 

mutations in the presenilins (Rogaev et al., 1995) and by similar 
observations by others (e.g., Cruts et al., 1995; Campion et al., 
1995). Thus, as used herein with respect to PSl genes and 
proteins, the term 6mutant6 is not restricted to these particular 

15 mutations but, rather, is to be construed as defined above. 

Direct sequencing of overlapping RT-PCR products spanning 
the 2.8 kb S182 transcript isolated from affected members of the 
six large pedigrees linked to chromosome 14 led initially to the 
discovery of five missense mutations in each of the six 

20 pedigrees. Each of these mutations co-segregated with the 

disease in the respective pedigrees, and were absent from upwards 
of 142 unrelated neurological ly normal subjects drawn from the 
same ethnic origins as the FAD pedigrees (284 unrelated 
chromosomes) . The location of the gene within the physical 

25 interval segregating with AD3 trait, the presence of eight 

different missense mutations which co-segregate with the disease 
trait in six pedigrees definitively linked to chromosome 14, and 
the absence of these mutations in 284 independent normal 
chromosomes cumulatively confirmed that the PSl gene is the AD3 

30 locus. Further biological support for this hypothesis arises 
from the facts that the residues mutated in FAD kindreds are 
conserved in evolution (e.g., hPSl v. mPSl) , that the mutations 
are located in domains of the protein which are also highly 
conserved in other vertebrate auid invertebrate homologues, auid 

35 ^that the PSl gene product is expressed at high levels in most 
regions of the brain, including those most severely affected by 
AD. 

Since the original discovery of the PSl gene, many 
additional mutations associated with the development of AD have 
40 been catalogued. Table 4 characterizes a number of these. Each 
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of the observed nucleotide deletions or substitutions occurred 
within the putative ORF of the PSl transcript, and would be 
predicted to change the encoded amino acid at the positions 
shown. The mutations are listed with reference to their 
5 nucleotide locations in SEQ ID NO: 1 and with reference to their 
amino add positions in SEQ ID NO: 2. An entry of "NA" indicates 
that the data was not available. 

As discussed in the next section, a number of PS2 mutations 
have also been found. A comparison of the hPSl and hPS2 

10 sequences is shown in Figure 4 and reveals that these pathogenic 
mutations are in regions of the PS2 protein which are conserved 
in the PSl protein. Therefore, corresponding mutations in the 
PSl protein may also be expected to be pathogenic and are 
included in the PSl mutants provided and enabled herein, 

15 Furthermore, any pathogenic mutation identified in any conserved 
region of a presenilin gene may be presumed to represent a mutant 
of the other presenilins which share that conserved region. 

Interestingly, mutations A260V, C263R, P264L, P267S, E2aOA, 
E280G, A285V, L286V, A291-319, G384A, L392V. and C410y all occur 

20 in or near the acidic hydrophilic loop between the putative 
transmembrane domains TM6 and TM7. Eight of these mutations 
(A260V, C263R, P264L, P2S7S, E280A, E280G, A285V, L286V) are also 
located in the alternative splice domain (residues 257-29^ of SEQ 
ID NO: 2} . 

25 All of these mutations can be assayed by a variety of 

strategies (direct nucleotide sequencing, allele specific 
oligonucleotides, ligation polymerase chain reaction, SSCP, 
RFLPs, new "DNA chip" technologies, etc.) using RT-PCR products 
representing the mature mRNA/cDNA sequence or genomic DNA. 

30 Finally, it should be noted that several polymorphisms with 

no apparent deleterious effect have also been discovered. One of 
these, a T-»G change of nucleotide 863 of SEQ ID NO: 1, causes a 
F205L polymorphism in TM4. Others (C-»A at bp 1700; Gr-»A at bp 
2603; deletion of bp 2620) are in the 3d UTR. 

35 B. P?? Mut^c^ntp? 

The strong similarity between PSl and the PS2 gene product 
raised the possibility that the PS2 gene might be the site of 
disease- causing mutations in some of a small number of early 
onset AD pedigrees in which genetic linkage studies have excluded 

40 chromosomes 14, 19 and 21. RT-PCR was used to isolate cDNAs 
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corresponding to the PS2 transcript from lymphoblasts, 
fibroblasts or post-mortem brain tissue of affected members of 
eight pedigrees with early onset FAD in which mutations in the 
0APP and PSl genes had previously been excluded by direct 
5 sequencing studies. 

Examination of these RT-PCR products detected a heterozygous 
substitution at nucleotide 1080 in all four affected members 
of an extended pedigree of Italian origin (FlolO) with early 
onset, pathologically confirmed FAD (onset 50-70 yrs) . This 
10 mutation would be predicted to cause a Met-»Val missense mutation 
at codon 239 in TM5. 

A second mutation (A-#T at nucleotide 787) causing a Asn-^Ile 
substitution at codon 141 in TM2 was found in affected members of 
a group of related pedigrees of Volga German ancestry 
15 (represented by cell lines AG093 69, AG09907, AG09952, and 
AG09905, Coriell Institute, Camden NJ) . Significantly, one 
subject {AG09907) was homozygous for this mutation, an 
observation compatible with the inbred nature of these pedigrees. 
Significantly, this subject did not have a significantly 
20 different clinical picture from those sxibjects. heterozygous for 

the N14H mutation. Neither of the PS2 gene mutations were found 
in 284 normal Caucasian controls nor were they present in 
affected members of pedigrees with the AD3 type of AD. 

Both of these PS2 mutations would be predicted to cause 
25 substitution of residues which are highly conserved within the 
PS1/PS2 gene family. 

An additional PS2 mutation is caused by a T-#C siibstitution 
at base pair 1624 causing an He to Thr substitution at codon 420 
of the C- terminus. This mutation was found in an additional case 
30 of early onset (45 yrs) familial AD. 

These hPS2 mutations are listed in Table 5 with reference to 
their nucleotide locations in SEQ ID NO: is and with reference to 
their amino acid positions in SEQ ID NO: 19. An entry of "NA" in 
the table indicates that the data was not available. 
35 As discussed in the previous section, a number of PSl 

mutations have also been foixnd. A comparison of the hPSl and 
hPS2 sequences is shown in Figure 4 and reveals that these 
pathogenic mutations are in regions of the PSl protein which are 
largely conserved in the PS2 protein. Therefore, corresponding 
40 mutations in the PS2 protein may also be expected to be 
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pathogenic and are included in the PS2 mutants provided and 
enaibled herein. Furthermore, any pathogenic mutation identified 
in any conserved region of a presenilin gene may be presumed to 
represent a mutant of the other presenilins which share that 
5 conserved region. 

The finding of a gene whose product is predicted to share 
stabstantial amino acid and structural similarities with the PSl 
gene product suggests that these proteins may be functionally 
related as independent proteins with overlapping functions but 
10 perhaps with slightly different specific activities, as 

physically associated subunits of a multimeric polypeptide or as 
independent proteins performing consecutive, functions in the same 
pathway. 

The observation of three different missense mutations in 

15 conserved domains of the PS2 protein in subjects with a familial 
form of AD argues that these mutations are, like those in the PSl 
gene, causal to AD. This conclusion is significant because, 
while the disease phenotype associated with mutations in the PSl 
gene (onset 30-50 yrs, duration 10 yrs) is subtly different from 

20 that associated with mutations in the PS2 gene (onset 40-70 yrs; 
duration up to 20 yrs) , the general similarities clearly argue 
that the biochemical pathway subsumed by members of this gene 
family is central to the genesis of at least early onset AD. The 
subtle differences in disease phenotype may reflect a lower level 

25 of expression of the PS2 transcript in the CNS, or may reflect a 
different role for the PS2 gene product. 

By analogy to the effects of PSl mutations, PS2 when mutated 
may cause aberrant processing of APP (Amyloid Precursor Protein) 
into A0 peptide, hyperphosphorylation of Tau microtubule 

30 associated protein and abnormalities of intracellular calcium 
homeostasis. Interference with these anomalous interactions 
provides for therapeutic intervention in AD. 

Finally, at least one nucleotide polymorphism has been found 
in one normal individual whose PS2 cDNA had a T-^C change at bp 

35 62 6 of SEQ ID NO: 18, without any change in the encoded amino 
acid sequence. 
III. Pref^yr^ta BnP?Q44in^n\ts 

Based, in part, upon the discoveries disclosed and described 
herein, the following preferred embodiments of the present 

40 invention are provided. 
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1. Isolated Nucleic Acids 

In one series of embodiments, the present invention provides 
isolated nucleic acids corresponding to, or relating to, the 
presenilin nucleic acid sequences disclosed herein. As described 
5 more fully below, these sequences include normal PSl and PS2 

sequences from humans and other mammalian species, mutant PSl and 
PS2 sequences from humans and other mammalian species, homologous 
sequences from non-mammalian species such as Drosophila and C. 
eleaans. subsets of these sequences useful as probes and PGR 

10 primers, siibsets of these sequences encoding fragments of the 
presenilin proteins or corresponding to particular structural 
domains or polymorphic regions, complementary or antisense 
sequences corresponding to fragments of the presenilin genes, 
sequences in which the presenilin coding regions have been 

15 operably joined to exogenous regulatory regions, and sequences 
encoding fusion proteins of the portions of the presenilin 
proteins fused to other proteins useful as markers of expression, 
as "tags" for purification, or in screens and assays for proteins 
interacting with the presenilins. 

20 Thus, in a first series of embodiments, isolated nucleic 

acid sequences are provided which encode normal or mutant 
versions of the PSl and PS2 proteins. Examples of such nucleic 
acid sequences are disclosed herein. These nucleic acids may be 
genomic sequences (e.g., SEQ ID NOs: 5-15) or may be cDNA 

25 sequences (e.g., SEQ ID NOs: 1, 3, 16, and 18). In addition, the 
nucleic acids may be recombinant genes or "minigenes" in which 
all or some of the introns Various combinations of the introns 
and exons and -local acting- regulatory elements may be 
engineered in propagation or expression constructs or vectors. 

30 Thus, for example, the invention provides nucleic acid sequences 
in which the alternative splicing variations described herein are 
incorporated at the DNA level, thus enabling cells including 
these seG[uences to express only one of the alternative splice 
variants at each splice position. As am example, a recombinant 

35 gene may be produced in which the 3' end of Exon 1 of the PSl 

gene (bp 1337 of SEQ ID NO: 5) has been joined directly to the 5' 
end of Exon 3 (bp 588 of SEQ ID NO: 6) so that only transcripts 
corresponding to the predominant transcript are produced. 
Obviously, one also may create a recombinant gene dforcingfi the 

40 alternative splice of Exon 2 and Exon 3. Similarly, a 
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recombinant gene may be produced in which one of the Exon 4 or 
Exon 9 splice variants of PSl (or the corresponding TM6-»7 splice 
variant of PS2) is incorporated into DNA such that cells 
including this recombinant gene Ccui express only one of these 
5 variants. For purposes of reducing the size of a recombinant 
presenilin gene, a cDNA gene may be employed or various 
combinations of the introns and untranslated exons may be removed 
from a DNA construct. Finally, recombinant genes may be produced 
in which the 5' UTR is altered such that transcription proceeds 

10 necessarily from one or the other of the two transcription 

initiation sites. Such constructs may be particularly useful, as 
described below, in identifying compounds which can induce or 
repress the expression of the presenilins. Many variations on 
these embodiments are now enabled by the detailed description of 

15 the presenilin genes provided herein. 

In addition to the disclosed presenilin sequences, one of 
ordinary skill in the art is now enabled to identify and isolate 
nucleic acids representing presenilin genes or cDNAs which are 
allelic to the disclosed sequences or which are heterospecif ic 

20 homologues. Thus, the present invention provides isolated 

nucleic acids corresponding to these alleles and homologues, as 
well, as the various above-described recombinant constructs 
derived from these sequences, by means which are well known in 
the art. Briefly, one of ordinary skill in the art may now 

25 screen preparations of genomic or cDNA, including samples 

prepared from individual orgemisms (e.g., human AD patients or 
their family members) as well as bacterial, viral, yeast or other 
libraries of genomic or" cDNA, " using probes or PGR primers to 
identify allelic or homologous sequences. Because it is 

30 desirable to identify additional presenilin gene mutations which 
may contribute to the development of AD or other disorders, 
because it is desirable to identify additional presenilin 
polymorphisms which are not pathogenic, and because it is also 
desired to create a variety of animal models which may be used to 

35 study AD and screen for potential therapeutics, it is 

particularly contemplated that additional presenilin sequences 
will be isolated from other preparations or libraries of human 
nucleic acids and from preparations or libraries from animals 
including rats, mice, hamsters, guinea pigs, rabbits, dogs, cats, 

40 goats, sheep, pigs, and non-hxoman primates. Furthermore, 
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presenilin homologues from yeast or invertebrate species, 
including C. eleaans and other nematodes, as well as Drosophila 
and other insects, may have particular utility for drug 
screening. For example, invertebrates bearing mutant presenilin 
5 homologues (or mammaliaui presenilin transgenes) which cause a 
rapidly occurring and easily scored phenotype (e.g., abnormal 
vulva or eye development after several days) can be used as 
screens for drugs which block the effect of the mutant gene. 
Such invertebrates may prove far more rapid and efficient for 
10 mass screenings than larger vertebrate animals. Once lead 

compounds are found through such screens, they may be tested in 
higher animals. 

Standard hybridization screening or PGR techniques may be 
employed {as used, for example, in the identification of the mPSl 

IS gene) to identify and/or isolate such allelic and homologous 

sequences using relatively short presenilin gene sequences. The 
sequences may include 8 or fewer nucleotides depending upon the 
nature of the target sequences, the method employed, and the 
specificity required. Future technological developments may 

20 allow the advantageous use of even shorter sequences. With 

current technology, sequences of 9-50 nucleotides, and preferably 
about 18-24 are preferred- These sequences may be chosen from 
those disclosed herein, or may be derived from other allelic or 
heterospecif ic homologues enabled herein. When probing mRNA or 

25 screening cDNA libraries, probes and primers from coding 

sequences (rather than introns) are preferably employed, and 
sequences which are omitted in alternative splice variants 
typically are avoided unless it is specifically desired to 
identify those variants. Allelic variants of the presenilin 

30 genes may be expected to hybridize to the disclosed sequences 
under stringent hybridization conditions, as defined herein, 
whereas lower stringency may be employed to identify 
heterospecif ic homologues. 

In another series of embodiments, the present invention 

35 provides for isolated nucleic acids which include subsets of the 
presenilin sequences or their complements. As noted above, such 
sequences will have utility as probes and PGR primers in the 
identification and isolation of allelic and homologous variants 
of the presenilin genes. Subsequences corresponding to the 

40 polymorphic regions of the presenilins, as described above, will 
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also have particular utility in screening and/or genotyping 
individuals for diagnostic purposes, as described below. In 
addition, and also as described below, such subsets will have 
utility for encoding (i) fragments of the presenilin proteins for 
5 inclusion in fusion proteins, (2) fragments which comprise 

functional domains of the presenilin proteins for use in binding 
studies, (3) fragments of the presenilin proteins which may be 
used as immunogens to raise antibodies against the presenilin 
proteins, and (4) fragments of the presenilins which may act as 

10 competitive inhibitors or as mimetics of the presenilins to 

inhibit or mimic their physiological fxinctions. Finally, such 
subsets may encode or represent complementary or antisense 
sequences which can hybridize to the presenilin genes or 
presenilin mRNA. transcripts under physiological conditions to 

15 inhibit the transcription or translation of those sequences. 

Therefore, depending upon the intended use, the present invention 
provides nucleic acid stibsequences of the presenilin genes which 
may have lengths varying from 8-10 nucleotides {e.g., for use as 
PGR primers) to nearly the full size of the presenilin genomic or 

20 cDNAs- Thus, the present invention provides isolated nucleic 
acids comprising sequences corresponding to at least 8-10, 
preferably 15, and more preferably at least 20 consecutive 
nucleotides of the presenilin genes, as disclosed or otherwise 
enabled herein, or to their complements. As noted sQ>ove, 

25 however, shorter sequences may be useful with different 
technologies . 

In another series of embodiments, the present invention 
provides nucleic acids in which the presenilin coding sequences, 
with or without introns or recombinantly engineered as described 

30 above, are operably joined to endogenous or exogenous 5' auid/or 
3 ' regulatory regions . The endogenous regulatory regions of the 
hPSl gene are described and disclosed in detail herein. Using 
the present disclosure and standard genetic techniques (e.g., PGR 
extensions, targeting gene walking) , one of ordinary skill in the 

35 art is also now enabled to clone the corresponding hPS2 5' and/or 
3' endogenous regulatory regions. Similarly, allelic variants of 
the hPSl and hPS 2 endogenous regulatory regions, as wells as 
endogenous regulatory regions from other mammalian homologues, 
are similarly enabled without undue experimentation. 

40 Alternatively, exogenous regulatory regions (i.e., regulatory 
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regions from a different conspecific gene or a heterospecif ic 
regulatory region) may be opercd>ly joined to the presenilin 
coding sequences in order to drive expression. Appropriate 5' 
regulatory regions will include promoter elements and may also 
5 include additional elements such as operator or enhancer 

sequences, ribosome binding sequences, RNA capping sequences, and 
the like. The regulatory region may be selected from sequences 
that control the expression of genes of prokaryotic or eukaryotic 
cells, their viruses, and combinations thereof. Such regulatory 

10 regions include, but are not limited to, the lac system, the trp 
system, the tac system, and the trc system; major operator and 
promoter regions of phage X; the control region of the fd coat 
protein; early and late promoters of SV40; promoters derived from 
polyoma, adenovirus, retrovirus, baculovirus, and simiaji virus; 

15 3-phosphoglycerate kinase promoter; yeast acid phosphatase 

promoters; yeast alpha-mating factors; promoter elements of other 
eukaryotic genes expressed in neurons or other cell types; and 
combinations thereof. In particular, regulatory elements may be 
chosen which are inducible or repressible (e.g., the fi- 

20 galactosidase promoter) to allow for controlled and/or 
manipulable expression of the presenilin genes in cells 
transformed with these nucleic acids. Alternatively, the 
presenilin coding regions may be operably joined with regulatory 
elements which provide for tissue specific expression in 

25 multicellular organisms. Such constructs are particularly useful 
for the production of transgenic organisms to cause expression of 
the presenilin genes only in appropriate tissues. The choice of 
appropriate regulatory regions is within the ability and 
discretion of one of ordinary skill in the art and the 

30 recombincmt use of many such regulatory regions is now 
established in the art. 

In atnother series of embodiments, the present invention 
provides for isolated nucleic acids encoding all or a portion of 
the presenilin proteins in the form of a fusion protein. In 

35 these embodiments, a nucleic acid regulatory region (endogenous 
or exogenous) is operably joined to a first coding region which 
is covalently joined in- frame to a second coding region. The 
second coding region optionally may be covalently joined to one 
or more additional coding regions and the last coding region is 

40 joined to a termination codon and, optionally, appropriate 3' 
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regulatory regions (e.g., polyadenylation signals). The 
presenilin sequences of the fusion protein may represent the 
first, second, or any additional coding regions. The presenilin 
sec[uences may be conserved or non- conserved domains and can be 
5 placed in any coding region of the fusion. The non-presenilin 
sequences of the fusion may be chosen according to the needs and 
discretion of the practitioner and are not limited by the present 
invention. Useful non-presenilin sequences include, however, 
short sequence "tags" such as antigenic determinants or poly-His 

10 tags which may be used to aid in the identification or 

purification of the resultant fusion protein. Alternatively, the 
non-presenilin coding region may encode a large protein or 
protein fragment, such as an enzyme or binding protein which also 
may assist in the identification and purification of the protein, 

15 or which may be useful in an assay such as those described below. 
Particularly contemplated presenilin fusion proteins include 
poly-His and GST (glutathione S- transferase) fusions which are 
useful in isolating and purifying the presenilins, and the yeast 
two hybrid fusions, described below, which are useful in assays 

20 to identify other proteins which bind to or interact with the 
presenilins. 

In another series of embodiments, the present invention 
provides isolated nucleic acids in the form of recombinant DNA 
constructs in which a marker or reporter gene (e.g., /3- 

25 galactosidase, lucif erase) is operably joined to the 5' 

regulatory region of a presenilin gene such that expression of 
the marker gene is under the control of the presenilin regulatory 
sequences. Using the presenilin regulatory regions disclosed or 
otherwise enabled herein, including regulatory regions from PSl 

30 and PS2 genes from hiiman and other mammalian species, one of 
ordinary skill in the art is now enabled to produce such 
constructs. As discussed more fully below, such isolated nucleic 
acids may be used to produce cells, cell lines or tremsgenic 
animals which are useful in the identification of compounds which 

35 can, directly or indirectly, differentially affect the expression 
of the presenilins. 

Finally, the isolated nucleic acids of the present invention 
include any of the above described sequences when included in 
vectors. Appropriate vectors include cloning vectors and 

40 expression vectors of all types, including plasmids, phagemids. 
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cosmids, episomes, and the like, as well as integration vectors. 
The vectors may also include various marker genes (e.g., 
antibiotic resistance or susceptibility genes) which are useful 
in identifying cells successfully transformed therewith. In 
5 addition, the vectors may include regulatory sequences to which 
the nucleic acids of the invention are operably joined, and/or 
may also include coding regions such that the nucleic acids of 
the invention, when appropriately ligated into the vector, are 
expressed as fusion proteins. Such vectors may also include 

10 vectors for use in yeast "two hybrid," baculovirus, and phage- 
display systems. The vectors may be chosen to be useful for 
prokaryotic, eukaryotic or viral e3q>ression, as needed or desired 
for the particular application. For example, vaccinia virus 
vectors or simian virus vectors with the SV40 promoter (e.g., 

15 pSV2) , or Herpes simplex virus or adeno-associated virus may be 
useful for transfection of mammalian cells including neurons in 
culture or in vivo , and the baculovirus vectors may be used in 
transfecting insect cells (e.g., butterfly cells). A great 
variety of different vectors are now commercially available and 

20 otherwise known in the art, and the choice of an appropriate 
vector is within the ability and discretion of one of ordinary 
skill in the art. 
2. Subatantiallv Pure Proteins 

The present invention provides for substantially pure 

25 preparations of the presenilin proteins, fragments of the 
presenilin proteins, and fusion proteins including the 
presenilins or fragments thereof. The proteins, fragments and 
fusions have utility, as described herein, in the generation of 
antibodies to normal and mutant presenilins, in the 

30 identification of presenilin binding proteins, and in diagnostic 
and therapeutic methods. Therefore, depending upon the intended 
use, the present invention provides substantially pure proteins 
or peptides comprising amino acid sequences which are 
subsequences of the complete presenilin proteins and which may 

35 have lengths varying from 4-10 amino acids (e.g., for use as 
immxmogens) , or 10-100 amino acids (e.g., for use in binding 
assays), to the complete presenilin proteins. Thus, the present 
invention provides substantially pure proteins or peptides 
comprising sequences corresponding to at least 4-5, preferably 6- 

40 10, and more preferably at least 50 or 100 consecutive amino 
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acids of the presenilin proteins, as disclosed or otherwise 
enabled herein. 

The proteins or peptides of the invention may be isolated 
and purified by any of a variety of methods selected on the basis 
5 of the properties revealed by their protein sequences. Because 
the presenilins possess properties of integral or membrane- 
spanning proteins, a membrane fraction of cells in which the 
presenilin is normally highly expressed (e.g., ne\irons, 
oligodendroglia, muscle, pancreas) may be isolated and the 

10 proteins extracted by, for example, detergent solubilization. 

Alternatively the presenilin protein, fusion protein, or fragment 
thereof, may be purified from cells transformed or transfected 
with expression vectors (e.g., baculovirus systems such as the 
pPbac and pMbac vectors (Stratagene, La Jolla, CA) ; yeast 

15 expression systems such as the pYESHIS Xpress vectors 

(Invitrogen, San Diego, CA) ; eukaryotic expression systems such 
as PCDNA3 (Invitrogen, San Diego, CA) which has constant 
constitutive expression, or LacSwitch (Stratagene, La Jolla, CA) 
which is inducible; or prokaryotic expression vectors such as 

20 PKK233-3 (Clontech, Palo Alto, CA) . In the event that the 

protein or fragment integrates into the endoplasmic reticulum or 
plasma membrane of the recombinant cells (e.g., immortalized 
human cell lines or other eukaryotic cells) , the protein may be 
purified from the membrane fraction. Alternatively, if the 

25 protein is not properly localized or aggregates in inclusion 
bodies within the recombinant cells (e.g., prokaryotic cells), 
the protein may be purified from whole lysed cells or from 
solubilized inclusion bodies.' 

Purification can be achieved using stsuadard protein 

30 purification procedures including, but not limited to, gel- 
filtration chromatography, ion-exchange chromatography, high- 
performance liquid chromatography (RP-HPLC, ion-exchange HPLC, 
size-exclusion HPLC, high-performance chromatofocusing 
chromatography, hydrophobic interaction chromatography, 

35 imraunoprecipitation, or immunoaf finity purification. Gel 
electrophoresis (e.g., PAGE, SDS-PAGE) can also be used to 
isolate a protein or peptide based on its molecular weight, 
charge properties and hydrophobicity. 

A presenilin protein, or a fragment thereof, may also be 

40 conveniently piirified by creating a fusion protein including the 
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desired presenilin sequence fused to another peptide such as an 
antigenic determinant or poly-His tag (e.g., QIAexpress vectors, 
QIAGEN Corp., Chatsworth, CA) , or a larger protein (e.g., GST 
using the pGEX-27 vector (Amrad, USA) or green fluorescent 
5 protein using the Green Lantern vector (GIBCO/BRL. Gaithersburg, 
MD) . The fusion protein may be expressed and recovered from 
prokaryotic or eukaryotic cells and purified by any standard 
method based upon the fusion vector sequence. For example, the 
fusion protein may be purified by immunoaf f inity or 

10 immunoprecipitation with an antibody to the non-presenilin 
portion of the fusion or, in the case of a poly-His tag, by 
affinity binding to a nickel column. The desired presenilin 
protein or fragment can then be further purified from the fusion 
protein by enzymatic cleavage of the fusion protein. Methods for 

15 preparing and using such fusion constructs for the purification 
of proteins are well known in the art and several kits are now 
commercially available for this purpose. In light of the present 
disclosure, one is now enabled to enqploy such fusion constructs 
with the presenilins. 

20 3. Antibodies to the Presenilins 

The present invention also provides antibodies, and methods 
of making antibodies, which selectively bind to the presenilin 
proteins or fragments thereof. Of particular in^ortance, by 
identifying the f\inctional domains of the presenilins and the 

25 polymorphic regions associated with AD, the present invention 

provides antibodies, and methods of making antibodies, which will 
selectively bind to and, thereby, identify and/or distinguish 
normal and mutant (i.e., pathogenic) forms of the presenilin 
proteins. The antibodies of the invention have utility as 

30 laboratory reagents for, inter alia , immunoaf f inity purification 
of the presenilins. Western blotting to identify cells or tissues 
expressing the presenilins, auid immunocytochemistry or 
immunofluorescence techniques to establish the subcellular 
location of the protein. In addition, as described below, the 

35 antibodies of the invention may be used as diagnostics tools to 
identify carriers of AD-related presenilin alleles, or as 
therapeutic tools to selectively bind and inhibit pathogenic 
forms of the presenilin proteins in vivo . 

The antibodies of the invention may be generated using the 

40 entire presenilin proteins of the invention or using any 
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presenilin epitope which is characteristic of that protein and 
which substantially distinguishes it from other host proteins. 
Such epitopes may be identified by comparing sequences of, for 
example, 4-10 amino acid residues from a presenilin sequence to 
5 coii5)uter databases of protein sequences from the relevant host. 
Preferably, the epitopes are chosen from the N- and C- termini, or 
from the loop domains which connect the transmembrane domains of 
the proteins. In particular, cuitibodies to the polymorphic N- 
terminal region, TMl-^2 loop, or Tm-^l loop are expected to have 

10 the greatest utility both diagnostically and therapeutically. On 
the other hand, antibodies against highly conserved domains are 
expected to have the greatest utility for purification or 
identification of presenilins. 

Using the IBI Pustell program, amino acid residue positions 

15 were identified as potential antigenic sites in the hPSl protein 
and may be useful in generating the antibodies of the invention. 
These positions, corresponding to positions in SEQ ID NO: 2, are 
listed in Table 6. 

other methods of choosing antigenic determinants may, of 

20 course, are known in the art and be employed. In addition, 
larger fragments (e.g., 8-20 or, preferably, 9-15 residues) 
including some of these epitopes may also be employed. For 
example, a fragment including the 109-112 epitope may comprise 
residues 107-114, or 105-116. Even larger fragments, including 

25 for example entire functional domains or multiple function 

domains (e.g., TMl, TMl-»2, and TM2 or TM6, TM6->7, and TM7) may 
also be preferred. For other presenilin proteins (e.g., for mPSl 
or other non-human homologues, or for PS2) , homologous sites may 
be chosen. 

30 Using the same IBI Pustell program, amino acid residue 

positions were identified as potential antigenic sites in the 
hPS2 protein and may be useful in generating the antibodies of 
the invention. These positions, corresponding to positions in 
SEQ ID NO: 19, are listed in Table 7. 

35 As for PSl, other methods of choosing antigenic determinants 

may, of course, are known in the art and be employed. In 
addition, larger fragments (e.g., 8-20 or, preferably, 9*15 
residues) including some of these epitopes may also be employed. 
For example, a fragment including the 310-314 epitope may 

40 comprise residues 308-316, or 307-317. Even larger fragments, 
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including for example entire functional domains or multiple 
function domains (e.g., TMl, TMl-*2, and TM2 or TM6, TM6-^7, and 
TM7) may also be preferred. For other presenilin proteins (e.g., 
for mPS2 or other non-human homologues, or for PSl) , homologous 
5 sites may be chosen. 

Presenilin immunogen preparations may be produced from crude 
extracts (e.g., membrane fractions of cells highly expressing the 
proteins) , from proteins or peptides stibstantially purified from 
cells which naturally or recombinant ly express them or, for short 

10 immunogens, by chemical peptide synthesis. The presenilin 

immunogens may also be in the form of a fusion protein in which 
the non-presenilin region is chosen for its adjuvant properties. 
As used herein, a presenilin immunogen shall be defined as a 
preparation including a peptide comprising at least 4-8, and 

15 preferably at least 9-15 consecutive amino acid residues of the 
presenilin proteins, as disclosed or otherwise enabled herein. 
Sequences of fewer residues may, of course, also have utility 
depending upon the intended use and future technological 
developments. Therefore, any presenilin derived sequences which 

20 are employed to generate antibodies to the presenilins should be 
regarded as presenilin immunogens. 

The antibodies of the invention may be polyclonal or 
monoclonal, or may be antibody fragments, including Fab 
fragments, F{ab')2# and single chain antibody fragments. In 

25 addition, after identifying useful antibodies by the method of 

the invention, recombinant £uitibodies may be generated, including 
cuiy of the antibody fragments listed above, as well as humanized 
antibodies based upon non- human antibodies to the presenilin 
- proteins. In light of the present disclosures of presenilin 

30 proteins, as well as the characterization of other presenilins 
enabled herein, one of ordinary skill in the art may produce the 
above -described antibodies by any of a variety of standard means 
well known in the art. For an overview of antibody techniques, 
see Antibody Engineering: A Practical Guide . Borrebaek, ed., W.H. 

35 Freeman & Company, NY (1992), or Antibody Engineering. 2nd Ed., 
Borrebaek, ed., Oxford University Press, Oxford (1995). 

As a general matter, polyclonal antibodies may be generated 
by first immxmizing a mouse, rabbit, goat or other suitedale 
animal with the presenilin immunogen in a suitable carrier. To 

40 increase the immunogenicity of the preparation, the immunogen may 
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be coupled to a carrier protein or mixed with an adjuvsuit (e.g., 
Freund's adjuvant). Booster injections, although not necessary 
are recommended. After an appropriate period to allow for the 
development of a humoral response, preferably several weeks, the 
5 animals may be bled and the sera may be purified to isolate the 
~ immunoglobulin component . 

Similarly, as a general matter, monoclonal anti-presenilin 
antibodies may be produced by first injecting a mouse, rabbit, 
goat or other suitable animal with a presenilin immunogen in a 

10 suitable carrier. As above, carrier proteins or adjuvants may be 
utilized and booster injections (e.g., bi- or tri -weekly over 8- 
10 weeks) are recommended. After allowing for development of a 
humoral response, the animals are sacrificed and their spleens 
are removed and resuspended in, for example, phosphate buffered 

15 saline (PBS). The spleen cells serve as a source of lymphocytes, 
some of which are producing antibody of the appropriate 
specificity. These cells are then fused with an immortalized 
cell line (e.g., myeloma), and the products of the fusion are 
plated into a number of tissue culture wells in the presence of a 

20 selective agent such as HAT. The wells are serially screened and 
replated, each time selecting cells making useful antibody. 
Typically, several screening and replating procedures are carried 
out until over 90% of the wells contain single clones which are 
positive for antibody production. Monoclonal antibodies produced 

25 by such clones may be purified by standard methods such as 
affinity chromatography using Protein A Sepharose, by ion- 
exchange chromatography, or by variations and combinations of 
these techniques. 

The antibodies of the invention may be labelled or 

30 conjugated with other compounds or materials for diagnostic 

and/or therapeutic uses. For example, they may be coupled to 
radionuclides, fluorescent compounds, or enzymes for imaging or 
therapy, or to liposomes for the targeting of compounds contained 
in the liposomes to a specific tissue location. 

35 4. Transformed Cell Lines 

The present invention also provides for cells or cell lines, 
both prokaryotic and eukaryotic, which have been trsuisformed or 
transfected with the nucleic acids of the present invention so as 
to cause clonal propagation of those tmcleic acids and/or 

40 expression of th proteins or peptides encoded thereby. Such 
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cells or cell lines will have utility both in the propagation and 
production of the nucleic acids and proteins of the present 
invention but also, as further described herein, as model systems 
for diagnostic and therapeutic assays. As used herein, the term 
5 "transformed cell" is intended to embrace any cell, or the 

descendant of any cell, into which has been introduced any of the 
nucleic acids of the invention, whether by transformation, 
transf ection, infection, or other means. Methods of producing 
appropriate vectors, treuisforming cells with those vectors, and 

10 identifying transf ormants are well known in the art and are only 
briefly reviewed here (see, for example, Sambrook et al. (1989) 
Molecular Cloning; A Laboratory Manual. 2nd ed. , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York) . 

Prokaryotic cells useful for producing the transformed cells 

15 of the invention include members of the bacterial genera 

Bgg^?yAchj.^ (e.g., E. cq;A) > Pseudomonas (e.g., P. aeruginosa > . 
and Bacit3^vs (e.g., B. subtillus. B . st earothermonhi lus ) . as well 
as many others well known and frequently used in the art. 
Prokaryotic cells are particularly useful for the production of 

20 large quantities of the proteins or peptides of the invention 
(e.g., normal or mutant presenilins, fragments of the 
presenilins, fusion proteins of the presenilins) . Bacterial 
cells (e.g., E. coli) may be used with a variety of expression 
vector systems including, for example, plasmids with the T7 RNA 

25 polymerase/promoter system, bacteriophage \ regulatory sequences, 
or M13 Phage mGPI-2. Bacterial hosts may also be transformed 
with fusion protein vectors which create, for example, lacZ, 
trpE, maltose-binding protein, poly-His tags, or glutathione -S- 
transferase fusion proteins. All of these, as well as many other 

30 prokaryotic expression systems, are well known in the art and 

widely available commercially (e.g., pGEX-27 (Amrad, USA) for GST 
fusions) . 

Eukaryotic cells and cell lines useful for producing the 
transformed cells of the invention include mammalian cells and 

35 cell lines (e.g., PC12, COS, CHO, fibroblasts, myelomas, 

neuroblastomas, hybridomas, human embryonic kidney 293, oocytes, 
embryonic stem cells), insect cells lines (e.g., using 
baculovirus vectors such as pPbac or pNbac (Stratagene, La Jolla, 
CA) ) , yeast (e.g., using yeast expression vectors such as pYESHIS 

40 (Invitrogen, CA) ) , and fungi. EuJcaryotic cells are particularly 
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useful for embodiments in which it is necessary that the 
presenilin proteins, or functional fragments thereof, perform the 
functions and/or undergo the intracellular interactions 
associated with either the normal or muteuit proteins. Thus, for 
5 example, transformed eukaryotic cells are preferred for use as 
models of presenilin function or interaction, and assays for 
screening candidate therapeutics preferably employ transformed 
exikaryotic cells. 

To accomplish expression in eukaryotic cells, a wide variety 

10 of vectors have been developed and are commercially available 
which allow inducible (e.g., LacSwitch e3cpression vectors, 
Stratagene, La Jolla, CA) or cognate (e.g., pcDNA3 vectors, 
Invitrogen, Chatsworth, CA) expression of presenilin nucleotide 
sequences under the regulation of an artificial promoter element. 

15 Such promoter elements are often derived from CMV or SV4 0 viral 
genes, although other strong promoter elements which are active 
in eukaryotic cells can also be employed to induce transcription 
of presenilin nucleotide sequences. Typically, these vectors 
also contain an artificial polyadenylation sequence and 3' UTR 

20 which can also be derived from exogenous viral gene sequences or 
from other eukaryotic genes- Furthermore, in some constructs, 
artificial, non-coding, spliceable introns and exons are included 
in the vector to enhance expression of the nucleotide sequence of 
interest (in this case, presenilin sequences) . These expression 

25 systems are commonly available from commercial sources and are 
typified by vectors such as pcDNA3 and pZeoSV (Invitrogen, San 
Diego, CA) . Both of the latter vectors have been successfully 
used to cause expression of presenilin proteins in tramsfected 
COS, CHO, and PC12 cells (Levesque et al. 1996). Innumerable 

30 commercially-available as well as custom-designed expression 

vectors are available from commercial sources to allow expression 
of any desired presenilin transcript in more or less any desired 
cell type, either const itutively or after exposure to a certain 
exogenous stimulus (e.g., withdrawal of tetracycline or exposure 

35 to IPTG) . 

Vectors may be introduced into the recipient or "host** cells 
by various methods well known in the art including, but not 
limited to, calcium phosphate transf ection, strontiiun phosphate 
transfection, DEAE dextran transfection, electroporation, 
40 lipofection (e.g., Dosper Liposomal transfection reagent. 
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Boehringer Mannheim, Germany) , microinjection, ballistic 
insertion on micro-beads, protoplast fusion or, for viral or 
phage vectors, by infection with the recombincuit virus or phage. 
5. Transgenic Animal Models 
5 The present invention also provides for the production of 

transgenic non-human animal models for the study of Alzheimer's 
Disease, for the screening of candidate pharmaceutical compounds, 
for the creation of explanted mammalicui CNS cell cultures (e.g., 
neuronal, glial, organotypic or mixed cell cultures) in which 

10 mutant or wild type presenilin sequences are expressed or in 
which the presenilin genes has been inactivated (e.g., **knock- 
out" deletions), and for the ^valuation of potential therapeutic 
interventions. Prior to the present invention, a partial animal 
model for Alzheimer's Disease existed via the insertion ajid over- 

15 egression of a mutant form of the human amyloid precursor 

protein gene as a minigene under the regulation of the platelet - 
derived growth factor /5 receptor promoter element (Games et al., 
1995) . This mutant (^APP^i^ Val-»Ile) causes the appearance of 
synaptic pathologry and amyloid peptide deposition in the brain 

20 of transgenic animals bearing this transgene in high copy number. 
These changes in the brain of the transgenic animal are very 
similar to that seen in human AD (Games et al., 1995) . It is, 
however, as yet unclear whether these animals become demented, 
but there is general consensus that it is now possible to 

25 recreate at least some aspects of AD in mice. 

Animal species which suitable for use in the animal models 
of the present invention include, but are not limited to, rats, 
mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, 
pigs, and non-human primates (e.g.. Rhesus monkeys, chimpanzees). 

30 For initial studies, transgenic rodents (e.g., mice) ar^ 

preferred due to their relative ease of maintenance and shorter 
life spans. Indeed, as noted above, transgenic yeast or 
invertebrates (e.g., nematodes, insects) may be preferred for 
some studies because they will allow for even more rapid and 

35 inexpensive screening. Transgenic non-humcui primates, however, 
may be preferred for longer term studies due to their greater 
similarity to humcuis and their higher cognitive abilities. 

Using the nucleic acids disclosed and otherwise enabled 
herein, there are now several availedsle approaches for the 

40 creation of a tratnsgenic animal model for Alzheimer's Disease. 
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Thus, the enabled animal models include (1) Animals in which a 
normal hiaman presenilin gene has been recombinantly introduced 
into the genome of the animal as an additional gene, under the 
regulation of either an exogenous or an endogenous promoter 
5 element, and as either a minigene or a large genomic fragment; in 
which a normal hiiman presenilin gene has been recombinantly 
substituted for one or both copies of the animal's homologous 
presenilin gene by homologous recombination or gene targeting; 
and/or in which one or both copies of one of the animal's 

10 homologous presenilin genes have been recombinantly "humanized" 
by the partial substitution of sequences encoding the human 
homologue by homologous recombination or gene targeting . These 
animals are useful for evaluating the effects of the transgenic 
procedures, and the effects of the introduction or substitution 

15 of a human or humanized presenilin gene. (2) Animals in which a 
mutant (i.e., pathogenic) human presenilin gene has been 
recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or 
an endogenous promoter element, and as either a minigene or a 

20 large genomic fragment; in which a mutant human presenilin gene 
has been recombinantly substituted for one or both copies of the 
animal's homologous presenilin gene by homologous recombination 
or gene targeting; and/or in which one or both copies of one of 
the animal's homologous presenilin genes have been recombinantly 

25 "humauiized" by the partial substitution of sequences encoding a 
mutant human homologue by homologous recombination or gene 
targeting. These animals are useful as models which will display 
some or all of the characteristics, whether at the biochemical, 
physiological auid/or behavioral level, of humsms carrying one or 

30 more alleles which are pathogenic of Alzheimer's Disease or other 
diseases associated with mutations in the presenilin genes. (3) 
Animals in which a mutamt version of one of that animal's 
presenilin genes (bearing, for example, a specific mutation 
corresponding to, or similar to, one of the pathogenic mutations 

35 of the htiman presenilins) has been recombinantly introduced into 
the genome of the animal as an additional gene, under the 
regulation of either an exogenous or an endogenous promoter 
element, auid as either a minigene or a large genomic fragment; 
and/or in which a mutant version of one of that animal's 

40 presenilin genes (bearing, for example, a specific mutation 



SUBSTITUTE SHEET (RULE 26) 



wo 96/34099 



PCT/CA96/00263 



- 61 - 

corresponding to, or similar to, one of the pathogenic mutations 
of the human presenilins) has been recombinamtly substituted for 
one or both copies of the animal's homologous presenilin gene by 
homologous recombination or gene targeting. These animals are 
5 also useful as models which will display some or all of the 

characteristics, whether at the biochemical, physiological and/or 
behavioral level, of humans carrying one or more alleles which 
are pathogenic of Alzheimer's Disease. (4) "Knock-out" animals 
in which one or both copies of one of the animal's presenilin 

10 genes have been partially or completely deleted by homologous 

recombination or gene targeting, or have been inactivated by the 
insertion or substitution by homologous recombination or gene 
targeting of exogenous sequences (e.g., stop codons, lox p 
sites) . Such animals are useful models to study the effects 

15 which loss of presenilin gene expression may have, to evaluate 

whether loss of function is preferable to continued expression of 
mutant forms, and to examine whether other genes can be recruited 
to replace a mutant presenilin (e.g., substitute PSl with PS2) or 
to intervene with the effects of other genes (e.g., APP or ApoE) 

20 causing AD as a treatment for i\D or other disorders. For 

example, a normal presenilin gene may be necessary for the action 
of mutant APP genes to actually be expressed as AD and, 
therefore, transgenic presenilin cuiimal models may be of use in 
elucidating such multigenic interactions. 

25 To create an animal model (e.g., a transgenic mouse) , a 

normal or mutant presenilin gene (e.g., normal or mutant hPSl, 
mPSl, hPS2, mPS2, etc.), or a normal or mutant version of a 
recombinant nucleic acid encoding at least a functional domain of 
a presenilin (e.g., a recombinant construct comprising an mPSl 

30 sequence into which has been svibstituted a nucleotide sequence 

corresponding to a hixman mutant sequence) can be inserted into a 
germ line or stem cell using standard techniques of oocyte 
microinjection, or transfection or microinjection into embryonic 
stem cells. Animals produced by these or similar processes are 

35 referred to as trauisgenic. Similarly, if it is desired to 

inactivate or replace an endogenous presenilin gene, homologous 
recombination using embryonic stem cells may be employed. 
Animals produced by these or similar processes are referred to as 
••knock-out" (inactivation) or "knock- in" (replacement) models. 

40 For oocyte injection, one or more copies of the recombinant 
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DNA constructs of the present invention may be inserted into the 
pronucleus of a just-fertilized oocyte. This oocyte is then 
reimplanted into a pseudo-pregnant foster mother. The livebom 
animals are screened for integrants using analysis of DNA (e.g., 
5 from the tail veins of offspring mice) for the presence of the 
inserted recombinant transgene sequences. The transgene may be 
either a complete genomic sequence injected as a YAC, BAC, PAC or 
other chromosome DNA fragment, a cDNA with either the natural 
promoter or a heterologous promoter, or a minigene containing all 

10 of the coding region and other elements found to be necessary for 
optiimxm expression. 

Retroviral infection of early embryos can also be done to 
insert the recombinant DNA constructs of the invention. In this 
method, the transgene (e.g., a normal or mutant hPSl or PS2 

15 sequence) is inserted into a retroviral vector which is used to 
infect embryos (e.g., mouse or non-human primate embryos) 
directly during the early stages of development to generate 
chimeras, some of which will lead to germline transmission. 

Homologous recombination using stem cells allows for the 

20 screening of gene transfer cells to identify the rare homologous 
recombination events. Once identified, these can be used to 
generate chimeras by injection of blastocysts, and a proportion 
of the resulting animals will show germline transmission from the 
recombinant line. This methodology is especially useful if 

25 inactivation of a presenilin gene is desired. For example, 
inactivation of the mPSl gene in mice may be accomplished by 
designing a DNA fragment which contains sequences from an mPSl 
exon flanking a selectable mar)cer.- Homologous recombination 
leads to the insertion of the marker sequences in the middle of 

30 an exon, causing inactivation of the mPSl gene and/or deletion of 
internal sequences. DNA analysis of individual clones can then 
be used to recognize the homologous recombination events. 

The techniques of generating transgenic animals, as well as 
the techniques for homologous recombination or gene targeting, 

35 are now widely accepted and practiced. A laboratory manual on 
the manipulation of the mouse embryo, for example, is available 
detailing standard laboratory techniques for the production of 
transgenic mice (Hogan et al., 1986). To create a transgene,. the 
target sequence of interest (e.g., mutant or wild-type presenilin 

40 sequences) are typically ligated into a cloning site located 
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downstream of some promoter element which will regulate the 
expression of RNA from the presenilin sequence. Downstream of 
the presenilin sequence, there is typically an artificial 
polyadenylation sequence. In the transgenic models that have 
5 been used to successfully create animals which mimic aspects of 
inherited human neurodegenerative diseases, the most successful 
promoter elements have been the platelet -derived growth factor 
receptor P gene subiinit promoter and the hamster prion protein 
gene promoter, although other promoter elements which direct 

10 expression in central nervous system cells would also be useful. 
An alternate approach to creating a trsuisgene is to use an 
endogenous presenilin promoter and regulatory sequences to drive 
expression of the presenilin transgene. Finally, it is possible 
to create trauisgenes using large genomic DNA fragments such as 

15 YACs which contain the entire presenilin gene as well as its 
appropriate regulatory sequences. Such constructs have been 
successfully used to drive humeui APP expression in transgenic 
mice (Lamb et al., 1993). 

Animal models can also be created by targeting the 

20 endogenous presenilin gene in order to alter the endogenous 

presenilin sequence by homologous recombination. These targeting 
events can have the effect of removing endogenous sequence 
(knock-out) or altering the endogenous sequence to create an 
amino acid change associated with human disease or an otherwise 

25 abnormal sequence (e.g., a sequence which is more like the human 
sequence than the original animal sequence) (knock- in animal 
models) . A large number vectors are avail«Q)le to accomplish this 
and appropriate sources of genomic DNA for mouse and other animal 
genomes to be targeted are commercially available from compcuxies 

30 such as GenomeSystems Inc. (St. Louis, Missouri, USA) • The 

typical feature of these targeting vector constructs is that 2 to 
4 kb of genomic DNA is ligated 5' to a selectaUble marker (e.g., a 
bacterial neomycin resistance gene under its own promoter element 
termed a "neomycin cassette" ) . A second DNA fragment from the 

35 gene of interest is then ligated downstream of the neomycin 
cassette but upstream of a second selectable marker (e.g., 
thymidine kinase) . The DNA fragments are chosen such that mutant 
sequences can be introduced into the germ line of the targeted 
animal by homologous replacement of the endogenous sequences by 

40 either one of the sequences included in the vector. 
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Alternatively, the sequences can be chosen to cause deletion of 
sequences that would normally reside between the left smd right 
arms of the vector surrounding the neomycin cassette. The former 
is known as a knock-in, the latter is known as a knock-out. 
5 Again, innumerable model systems have been created, particularly 
for targeted knock-outs of genes including those relevant to 
neurodegenerative diseases (e.g., targeted deletions of the 
murine APP gene by Zheng et al., 1995; targeted deletion of the 
murine prion gene associated with adult onset human CNS 

10 degeneration by Bueler et al., 1996). 

Finally, equivalents of transgenic animals, including 
animals with mutated or inactivated presenilin genes, may be 
produced using chemical or x-ray mutagenesis of gametes, followed 
by fertilization. Using the isolated nucleic acids disclosed or 

15 otherwise enabled herein, one of ordinary skill may more rapidly 
screen the resulting offspring by, for example, direct sequencing 
RFLP, PCR, or hybridization analysis to detect mutauits, or 
Southern blotting to demonstrate loss of one allele by dosage. 
6. Assays for Drugs Which Affect Presenil in Expression 

20 In another series of embodiments, the present invention 

provides assays for identifying small molecules or other 
compounds which are capable of inducing or inhibiting the 
expression of the presenilin genes and proteins (e.g., PSl or 
PS2) . The assays may be performed in vitro using non- trsmsf ormed 

25 cells, immortalized cell lines, or recombinant cell lines, or in 
vivo using the transgenic animal models enabled herein. 

In particular, the assays may detect the presence of 
increased or decreased- expression of PSl, PS2 or other 
presenilin- related genes or proteins on the basis of increased or 

30 decreased mRNA expression (using, e.g., the nucleic acid probes 
disclosed and enabled herein) , increased or decreased levels of 
PSl, PS2 or other presenilin- related protein products (using, 
e.g., the anti -presenilin antibodies disclosed and enabled 
herein) , or increased or decreased levels of expression of a 

35 marker gene (e.g., /3-galactosidase or luciferase) operably joined 
to a presenilin 5' regulatory region in a recombinant construct. 

Thus, for example, one may culture cells known to express a 
particular presenilin and add to the culture medium one or more 
test compounds. After allowing a sufficient period of time 

40 (e.g., 0-72 hours) for the compound to induce or inhibit the 
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expression o£ the presenilin, any change in levels of expression 
from an established baseline may be detected using any of the 
techniques described above and well known in the art. In 
particularly preferred embodiments, the cells are from an 
5 immortalized cell line such as a human neuroblastoma, 

glioblastoma or a hybridoma cell line. Using the nucleic acid 
probes and /or antibodies disclosed and enabled herein, detection 
of changes in the expression of a presenilin, and thus 
identification of the compoimd as an inducer or repressor of 

10 presenilin expression, requires only routine experimentation. 

In particularly preferred embodiments, a recombinant assay 
is employed in which a reporter gene such a /3-galactosidase, 
green fluorescent protein , alkaline phosphatase, or lucif erase 
is opercUDly joined to the 5' regulatory regions of a presenilin 

15 gene. Preferred vectors include the Green Lantern 1 vector 

(GIBCO/BRL, Gaithersburg, ND and the Great EScAPe pSEAP vector 
(Clontech, Palo Alto) • The hPSl regulatory regions disclosed 
herein, or other presenilin regulatory regions, may be easily 
isolated and cloned by one of ordinary skill in the art in light 

20 of the present disclosure of the coding regions of these genes. 
The reporter gene and regulatory regions are joined in-frame (or 
in each of the three possible reading frames) so that 
transcription and translation of the reporter gene may proceed 
under the control of the presenilin regulatory elements. The 

25 recombinant construct may then be introduced into any appropriate 
cell type although mammalian cells are preferred, and human cells 
are most preferred. The transformed cells may be grown in 
culture and, after establishing the baseline level of expression 
of the reporter gene, test compounds may be added to the medium. 

30 The ease of detection of the expression of the reporter gene 
provides for a rapid, high through-put assay for the 
identification of inducers and repressors of the presenilin gene. 

Compounds identified by this method will have potential 
utility in modifying the expression of the PSl, PS2 or other 
35 presenilin- related genes in vivo . These compoxindiS may be further 
tested in the animal models disclosed and enabled herein to 
identify those compoxinds having the most potent in vivo effects. 
In addition, as described herein with respect to small molecules 
having presenilin-binding activity, these molecules may serve as 
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"lead compounds" for the further development of pharmaceuticals 
by, for example, subjecting the compounds to sequential 
modifications, molecular modeling, and other routine procedures 
employed in rational drug design. 
5 7. Identifiq^tion of Compounds with Presenilin Binr jino Capagit-v 
In light of the present disclosure, one of ordinary skill in 
the art is enabled to practice new screening methodologies which 
will be useful in the identification of proteins and other 
compounds which bind to, or otherwise directly interact with, the 

10 presenilins. The proteins and compounds will include endogenous 
cellular coii^onents which interact with the presenilins in vivo 
and which, therefore, provide new targets for pharmaceutical and 
therapeutic interventions, as well as recombinant, synthetic and 
otherwise exogenous compoiinds which may have presenilin binding 

15 capacity eind, therefore, may be candidates for pharmaceutical 
agents. Thus, in one series of embodiments, cell lysates or 
tissue homogenates (e.g., human brain homogenates, lymphocyte 
lysates) may be screened for proteins or other compounds which 
bind to one of the normal or mutant presenilins. Alternatively, 

20 any of a variety of exogenous compounds, both naturally occurring 
and/or synthetic (e.g., libraries of small molecules or 
peptides) , may be screened for presenilin binding capacity. 
Small molecules are particular preferred in this context becatise 
they are more readily absorbed after oral administration, have 

25 fewer potential antigenic determinants, and/ or are more likely to 
cross the blood brain barrier than larger molecules such as 
nucleic acids or proteins. The methods of the present invention 
are particularly useful in that they may be used to identify 
molecules which selectively or preferentially bind to a mutant 

30 form of a presenilin protein (rather than a normal form) and, 
therefore, may have particular utility in treating the 
heterozygous victims of this dominant autosomal disease. 

Because the normal physiological roles of PSl and PS2 are 
still unknown, compounds which bind to normal, mutant or both 

35 forms of these presenilins may have utility in treatments and 
diagnostics. Compounds which bind only to a normal presenilin 
may, for example, act as enhancers of its normal activity and 
thereby at least partially compensate for the lost or abnormal 
activity of mutant forms of the presenilin in Alzheimer's Disease 

40 victims. Compounds which bind to both normal and mutauit forms of 
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a presenilin may have utility if they differentially affect the 
activities of the two forms so as to alleviate the overall 
departure from normal function. Alternatively, blocking the 
activity of both normal and mutant forms of either PSl or PS2 may 
5 have less severe physiological and clinical consequences than the 
normal progress of the disease and, therefore, compounds which 
bind to and inhibit the activity of both normal and mutant forms 
of a presenilin may be therapeutically xiseful. Preferably, 
however, compounds are identified which have a higher affinity of 
10 binding to mutant presenilin than to normal presenilin (e.g., at 
least 2-10 fold higher I^) and which selectively or preferentially 
inhibit the activity of the mutant form. Such compoimds may be 
identified by using any of the techniques described herein and by 
then comparing the binding affinities of the candidate 
,15 compound(s) for the normal and mutant forms of PSl or PS2. 

The effect of agents which bind to the presenilins (normal 
or mutant forms) can be monitored either by the direct monitoring 
of this binding using instruments (e.g., BlAcore, LKB Pharmacia, 
Sweden) to detect this binding by, for example, a change in 
20 fluorescence, molecular weight, or concentration of either the 
binding agent or presenilin component, either in a soluble phase 
or in a substrate-bound phase. 

Once identified by the methods described above, the 
candidate compounds may then be produced in quantities sufficient 
25 for pharmaceutical administration or testing (e.g., fig or mg or 
greater quantities), and formulated in a pharmaceutically 
acceptable carrier (see, e.g.. Remington ' s Pharmaceutical 
Sciences, Gennaro. A., ed.-. Mack Pub; , 1990) , These candidate 
compounds may then be administered to the transformed cells of 
30 the invention, to the transgenic animal models of the invention, 
to cell lines derived from the animal models or from human 
patients, or to Alzheimer's patients. The animal models 
described and enabled herein are of particular utility in further 
testing candidate compounds which bind to normal or mutamt 
35 presenilin for their therapeutic efficacy. 

In addition, once identified by the methods described above, 
the ceuididate compounds may also serve as *lead compounds'* in the 
design and development of new pharmaceuticals. For example, as 
in well known in the art, sequential modification of small 
40 molecules (e.g., amino acid residue replacement with peptides; 
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functional group replacement with peptide or non-peptide 
compounds) is a standard approach in the pharmaceutical industry 
for the development of new pharmaceuticals. Such development 
generally proceeds from a **lead compound" which is shown to have 
5 at least some of the activity (e.g., PSl binding or blocking 

ability) of the desired pharmaceutical. In particular, when one 
or more compounds having at least some activity of interest 
(e.g.r modulation of presenilin activity) are identified, 
structural comparison of the molecules can greatly inform the 

10 skilled practitioner by suggesting portions of the lead compounds 
which should be conserved and portions which may be varied in the 
design of new candidate confounds. Thus, the present invention 
also provides a mesois of identifying lead compounds which may be 
sequentially modified to produce new candidate corapoiinds for use 

15 in the treatment of Alzheimer's Disease. These new compounds 
then may be tested both for presenilin-binding or blocking 
(e.g., in the binding assays described above) and for therapeutic 
efficacy (e.g., in the animal models described herein) . This 
procedure may be iterated until compounds having the desired 

20 therapeutic activity and/or efficacy are identified. 

In each of the present series of embodiments, an assay is 
conducted to detect binding between a "presenilin component** and 
some other moiety. Of particular utility will be sequential 
assays in which compounds are tested for the ability to bind to 

25 only the normal or only the mutauxt forms of the presenilin 

functional domains using mutant and normal presenilin components 
in the -bin d ing assays. Such compounds are expected to have the 
greatest therapeutic utilities, as described more fully below. 
The "presenilin component" in these assays may be a complete 

30 normal or mutant form of a presenilin protein (e.g., an hPSl or 
hPS2 variant) but need not be. Rather, particular functional 
domains of the presenilins, as described above, may be employed 
either as separate molecules or as part of a fusion protein. For 
example, to isolate proteins or compounds that interact with 

35 these functional domains, screening may be carried out using 
fusion constructs and/or synthetic peptides corresponding to 
these regions. Thus, for PS2, GST- fusion peptides may be made 
including sequences corresponding approximately to amino acids 1 
to 87 (N-terminus) , or 269-387 (TM6-#7 loop), or to any other 

40 conserved domain of interest. For shorter functional domains, a 
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synthetic peptide may be produced corresponding, for example, 
approximately to amino acids 107 to 134 (TMl-»2 loop) . Similarly, 
for PSl, GST- or other fusion peptides may be produced including 
sequences corresponding approximately to amino acids l to 81 (N- 
5 terminus) or 266 to 410 (TM6-»7 loop) or a synthetic peptide may 
be produced corresponding approximately to amino acids 101 to 131 
(TMl-»2 loop) - Obviously, various combinations of fusion proteins 
and presenilin f\inctional domains are possible and these are 
merely examples. In addition, the functional domains may be 

10 altered so as to aid in the assay by, for exanqple, introducing 

into the fiinctional domain a reactive group or amino acid residue 
(e.g., cysteine) which will facilitate immobilization of the 
domain on a substrate (e.g., using sulfhydryl reactions). Thus, 
for example, the PSl TMl-»2 loop fragment (3l-mer) has been 

IS synthesized containing an additional C-terminal cysteine residue. 
This peptide will be used to create an affinity substrate for 
affinity chromatography (Sulfo-link; Pierce) to isolate binding 
proteins for microsec[uencing. Similarly, other functional domain 
or antigenic fragments may be created with modified residues 

20 (see, e.g., Example 10). 

The proteins or other compounds identified by these methods 
may be purified and characterized by any of the standard methods 
known in the art. Proteins may, for example, be purified and 
separated using electrophoretic (e.g., SDS-PAGE, 2D PAGE) or 

25 chromatographic (e.g., HPLC) techniques and may then be 

microsequenced. For proteins with a blocked N- terminus, cleavage 
(e.g., by CNBr and/or trypsin) of the particular binding protein 
is used to. release peptide fragments. Further 

purification/characterization by HPLC and microsequencing and/or 
30 mass spectrometry by conventional methods provides Internal 
sequence data on such blocked proteins. For non-protein 
compounds, standard organic chemical analysis techniques (e.g., 
IR, NMR and mass spectrometry; functional group analysis; X-ray 
crystallography) may be employed to determine their structure and 
35 identity. 

Methods for screening cellular, lysates, tissue homogenates, 
or small molecule libraries for candidate presenilin-binding 
molecules are well known in the art and, in light of the present 
disci sure, may now be employed to identify compounds which bind 
40 to normal or mutant presenilin components or which modulate 
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presenilin activity as defined by non-specific measures {e.g., 
changes in intracellular Ca^% GTF/GDP ratio) or by specific 
measures (e.g., changes in peptide production or chemges in 
the expression of other downstream genes which can be monitored 
5 by differential display, 2D gel electrophoresis, differential 
hybridization, or SAGE methods) . The preferred methods involve 
variations on the following techniques: (1) direct extraction 
by affinity chromatography; (2) co-isolation of presenilin 
components and bound proteins or other compounds by 

10 imraimoprecipitation; (3) the Biomolecular Interaction Assay 
{BIAcore) ; and (4) the yeast two-hybrid systems. These and 
others are discussed separately below. 
A. Affinity Chromatography 

In light of the present disclosure, a variety of affinity 

15 binding techniques well )cnown in the art may be employed to 

isolate proteins or other compounds which bind to the presenilins 
disclosed or otherwise enabled herein. In general, a presenilin 
component may be immobilized on a substrate (e.g., a column or 
filter) and a solution including the test compound(s) is 

20 contacted with the presenilin protein, fusion or fragment \mder 
conditions which are permissive for binding. The siibstrate is 
then washed with a solution to remove lanbound or weakly botind 
molecules. A second wash may then elute those compoxinds which 
strongly bound to the immobilized normal or mutant presenilin 

25 component. Alternatively, the test compounds may be immobilized 
and a solution containing one or more presenilin components may 
be contacted with the column, filter or other substrate. The 
ability of the presenilin component to bind to the test compoimds 
may be determined as above or a labeled form of the presenilin 

30 component (e.g., a radio-labeled or chemiluminescent functional 
domain) may be used to more rapidly assess binding to the 
substrate- immobilized compound (s) . In addition, as both PSl and 
PS2 are believed to be membrane associated proteins, it may be 
preferred that the presenilin proteins, fusion or fragments be 

35 incorporated into lipid bilayers (e.g., liposomes) to promote 
their proper folding. This is particularly true when a 
presenilin component including at least one transmembrane domain 
is employed. Such presenilin- liposomes may be immobilized on 
substrates (either directly or by means of another element in the 

40 liposome membrane) , passed over sxibstrates with immobilized test 
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compounds, or used in any of a variety of other well known 
binding assays for membrane proteins. Alternatively, the 
presenilin component may be isolated in a membrane fraction from 
cells producing the component, and this membrane fraction may be 
5 used in the binding assay. 
B. Co-Immunopreeipitation 

Another well characterized technique for the isolation of 
the presenilin components and their associated proteins or other 
compoxmds is direct immunoprecipitation with antibodies. This 
10 procedure has been successfully used, for example, to isolate 
many of the synaptic vesicle associated proteins (Phizicky and 
Fields, 1994). Thus, either normal or mutant, free or membrane- 
bound presenilin components may be mixed in a solution with the 
candidate compound (s) under conditions which are permissive for 
15 binding, and the presenilin component may be immunoprecipitated* 
Proteins or other compounds which co-immunoprecipitate with the 
presenilin component may then be identified by standard 
techniques as described above. General techniques for 
immunoprecipitation may be foxmd in, for example, Harlow and 
20 Lane, (1988) Antibodies: A Laboratory Manual. Cold Spring Harbor 
Press, Cold Spring Harbor, NY. 

The antibodies employed in this assay, as described and 
enabled herein, may be polyclonal or monoclonal, and include the 
various antibody fragments (e.g.. Fab, FCab'),,) as well as single 
25 chain antibodies, and the like. 

C. The Bioraolecular Interaction Afigay 

Another useful method for the detection and isolation of 
binding proteins is the Biomolecular Interaction Assay or 
"BIAcore" system developed by Pharmacia Biosensor and described 
30 in the manufacturer's protocol (LKB Pharmacia, Sweden). In light 
of the present disclosure, one of ordinary skill in the art is 
now enabled to employ this system, or a sxibstauatial equivalent, 
to identify proteins or other compounds having presenilin binding 
capacity. The BIAcore system uses an affinity purified anti-GST 
35 antibody to immobilize GST-fusion proteins onto a sensor chip. 

Obviously, other fusion proteins and corresponding antibodies may 
be substituted. The sensor utilizes surface plasmon resonance 
which is an optical phenomenon that detects changes in refractive 
indices . A homogenate of a tissue of interest is passed over the 
40 immobilized fusion protein and protein-protein interactions are 
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registered as changes in the refractive index. This system can 
be used to determine the kinetics of binding and to assess 
whether any observed binding is of physiological relevance. 

D. The Yeast Two-Hvbrid Svstem 

5 The yeast "two-hybrid" system takes advantage of 

transcriptional factors that are composed of two physically 
separable, f\anctional domains (Phizicky euid Fields, 1994} . The 
most commonly used is the yeast GAL4 transcriptional activator 
consisting of a DNA binding domain and a transcriptional 

10 activation domain. Two different cloning vectors are used to 
generate separate fusions of the GAL4 domains to genes encoding 
potential binding proteins. The fusion proteins are co- 
expressed, targeted to. the nucleus and, if interactions occur, 
activation of a reporter gene (e.g., lacZ) produces a detectable 

15 phenotype. For example, the Clontech Matchmaker System-2 may be 
used with the Clontech brain cDNA GAL4 activation domain fusion 
library with presenilin-GAL4 binding domain fusion clones 
(Clontech, Palo Alto, CA) . In light of the disclosures herein, 
one of ordinary skill in the art is now enabled to produce a 

20 variety of presenilin fusions, including fusions including either 
normal or mutant functional domains of the presenilin proteins, 
and to screen such fusion libraries in order to identify 
presenilin binding proteins. 

E. Other Methods 

25 The nucleotide sequences and protein products, including 

both mutant and normal forms of these nucleic acids and their 
corresponding proteins, can be used with the aO^ove techniques to 
isolate other interacting proteins, and to identify other genes 
whose expression is altered by the over- express ion of normal 

30 presenilin sequences, by the under -express ion of normal 

presenilins sequences, or by the expression of mutant presenilin 
sequences. Identification of these interacting proteins, as well 
as the identification of other genes whose expression levels are 
altered in the face of mutant presenilin sequences (for instance) 

35 will identify other gene targets which have direct relevance to 
the pathogenesis of this disease in its clinical or pathological 
forms. Specifically, other genes will be identified which may 
themselves be the site of other mutations causing Alzheimer's 
Disease, or which can themselves be targeted therapeutically 

40 (e.g., to reduce their expression levels to normal or to 
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pharmacologically block the effects of their over-expression) as 
a potential treatment for this disease. Specifically, these 
techniques rely on PCR-based and/or hybridization-based methods 
to identify genes which are differentially expressed between two 
5 conditions (a cell line expressing normal presenilins compared to 
the same cell type expressing a mutant presenilin seG[uence) . 
These technic|ues include differential display, serial analysis of 
gene expression (SAGE) , atnd mass -spectrometry of protein 2D-gels 
and subtractive hybridization (reviewed in Nowak, 1995 euid Kahn, 
10 1995) . 

As will be obvious to one of ordinary skill in the art, 
there are numerous other methods of screening individual proteins 
or other compounds, as well as large libraries of proteins or 
other compounds (e.g., phage display libraries and cloning 

15 systems from Stratagene, La Jolla, CA) to identify molecules 
which bind to normal or mutant presenilin components. All of 
these methods comprise the step of mixing a normal or mutant 
presenilin protein, fusion, or fragment with test compounds, 
allowing for binding (if any), and assaying for bound complexes. 

20 All such methods are now enabled by the present disclosure of 
sul)stantially pure presenilins, substantially pure presenilin 
fimctional domain fragments, presenilin fusion proteins, 
presenilin antibodies, auid methods of making and using the same. 
8. Methods of Identifying Compounds Modulating Presenilin 

25 Activity 

In another series of embodiments, the present invention 
provides for methods of identifying compounds with the ability to 
modulate the activity of normal and mutant presenilins- As used 
with respect to this series of embodiments, the term dactivity6 

30 broadly includes gene cuid protein expression, presenilin protein 
post-trwslation processing, trafficking and localization, and 
any functional activity (e.g., enzymatic, receptor-effector, 
binding, channel), as well as downstream affects of any of these. 
The presenilins appear to be integral membrane proteins normally 

35 associated with the endoplasmic reticulum and/or Golgi apparatus 
and may have fvtnctions involved in the transport or trafficking 
of APP and/or the regulation of intracellular calcium levels. In 
addition, it is known that presenilin mutations are associated 
with the increased production of A0 peptides, the appearance of 

40 amyloid plaques and neurofibrillary tangles, decreases in 
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cognitive function, and apoptotic cell death. Therefore, using 
the transformed cells and transgenic animal models of the present 
invention, cells obtained from subjects bearing a mutant 
presenilin gene, or animals or human subjects bearing naturally 
5 occurring presenilin mutations, it is now possible to screen 
candidate pharmaceuticals and treatments for their therapeutic 
effects by detecting changes in one or more of these functional 
characteristics or phenotypic manifestations of normal or mutant 
presenilin expression. 

10 Thus, the present invention provides methods for screening 

or assaying for proteins, small molecules or other compounds 
which modulate presenilin activity by contacting a cell . in vivo 
or in vitro with a candidate compoxind and assaying for a change 
in a marker associated with normal or mutant presenilin activity. 

15 The marker associated with presenilin activity may be any 
measurable biochemical, physiological, histological and/ or 
behavioral characteristic associated with presenilin expression. 
In particular, useful markers will include any measurable 
biochemical, physiological, histological and/or behavioral 

20 characteristic which distinguishes cells, tissues, animals or 
individuals bearing at least one mutant presenilin gene from 
their normal counterparts. In addition, the marker may be any 
specific or non-specific measure of presenilin activity. 
Presenilin specific measures include measures of presenilin 

25 expression (e.g., presenilin mRNA or protein levels) which may 
employ the^ nucleic acid probes or antibodies of the present 
invention. Non-specific measures include changes in cell 
physiology such as pH, intracellular calcium, cyclic AMP levels, 
GTP/GDP ratios, phosphatidyl inositol activity, protein 

3.0 phosphorylation, etc., which can be monitored on devices such as 
the cytoaensor microphysiometer (Molecular Devices Inc., United 
States) . The activation or inhibition of presenilin activity in 
its mutant or normal form can also be monitored by examining 
changes in the expression of other genes which are specific to 

35 the presenilin pathway leading to Alzheimer's Disease. These can 
be assayed by such techniques as differential display, 
differential hybridization, and SAGE (sequential analysis of 
gene expression) , as well as by two dimensional gel 
electrophoresis of cellular lysates. In each case, the 

40 differentially-expressed genes can be ascertained by inspection 
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of identical studies before and after application of the 
ceuididate compound. Furthermore, as noted elsewhere, the 
particular genes whose expression is modulated by the 
administration of the candidate compound can be ascertained by 
5 cloning^ nucleotide sequencing, amino acid sequencing, or mass 
spectrometry (reviewed in Nowak, 1995). 

In general, a cell may be contacted with a candidate 
compound and, after an appropriate period (e.g., 0-72 hours for 
most biochemical measures of cultured cells) , the mar)cer of 

10 presenilin activity may be assayed and compared to a baseline 
measurement. The baseline measurement may be made prior to 
contacting the cell with the candidate compound or may be an 
external baseline estaUslished by other experiments or known in 
the art. The cell may be a transformed cell of the present 

15 invention or an explant from an animal or individual. In 
particular, the cell may be an explant from a carrier of a 
presenilin mutation (e.g., a human subject with AlzheimerSs 
Disease) or an animal model of the invention (e.g., a transgenic 
nematode or mouse bearing a mutant presenilin gene) . To augment 

20 the effect of presenilin mutations on the Afi pathway, transgenic 
cells or animals may be employed which have increased 
production. Preferred cells include those from neurological 
tissues such as neuronal, glial or mixed cell cultures; and 
cultured fibroblasts, liver, kidney, spleen, or bone marrow. The 

25 cells may be contacted with the candidate compounds in a culture 
in vitro or may be administered in vivo to a live animal or human 
subject. For live animals or human siibjects, the test compound 
may be administered orally- or by any parenteral route suitable to 
the compotind. For clinical trials of human subjects, 

30 measurements may be conducted periodically (e.g., daily, weekly 
or monthly) for several months or years. 

Because most carriers of presenilin mutations will be 
heterozygous (i.e., bearing one normal and one mutant presenilin 
allele) , compounds may be tested for their ability to modulate 

35 normal as well as presenilin activity. Thus, for example, 

compoxinds which enhance the function of normal presenilins may 
have utility in treating presenilin associated disorders such as 
Alzheimerds Disease. Alternatively, because suppression of the 
activity of both normal cind mutamt presenilins in a heterozygous 

40 individual may have less severe clinical consequences than 
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progression of the associated disease, it may be desired to 
identify compoxind which inactivate or suppress all forms of the 
presenilins. Preferably, however, compounds are identified which 
selectively or specifically inactivate or suppress the activity 
5 of a mutant presenilin without disrupting the function of a 
normal presenilin gene or protein. 

In light of the identification, characterization, and 
disclosure herein of the presenilin genes and proteins, the 
presenilin nucleic acid probes and antibodies, and the presenilin 

10 transformed cells and transgenic animals of the invention, one of 
ordinary skill in the art is now enabled by perform a great 
variety of assays which will detect the modulation of presenilin 
activity by candidate compounds. Particularly preferred aild 
contemplated embodiments are discussed in some detail below. 

15 A. Pyes^^41ir^ Expregsipn 

In one series of embodiments, specific measures of 
presenilin expression are employed to screen candidate compounds 
for their ability to affect presenilin activity. Thus, using the 
presenilin nucleic acids and antibodies disclosed and otherwise 

20 enabled herein, one may use mRNA levels or protein levels as a 
marker for the ability of a candidate compound to modulate 
presenilin activity. The use of such probes and antibodies to 
measure gene and protein expression is well known in the art and 
discussed elsewhere herein. Of particular interest may be the 

25 identification of compounds which can alter the relative levels 
of different splice variants of the presenilins. Many of the 
presenilin mutations associated with AlzheimerSs Disease, for 
example, are located- in the -region of the putative TM6-»7 loop 
which is siibject to alternative splicing in some peripheral 

30 tissues (e.g., white blood cells). Compounds which can increase 
the relative frequency of this splicing event may, therefore, be 
effective in preventing the expression of mutations in this 
region. 

B. Intracellular Localization 

35 In another series of embodiments, compounds may be screened 

for their ability to modulate the activity of the presenilins 
based upon their effects on the trafficking and intracellular 
localization of the presenilins. The presenilins have been seen 
immunocytochemically to be localized in membrane structures 

40 associated with the endoplasmic reticulum and Golgi apparatus. 
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and one presenilin mutant (H163R) , but not others, has been 
visualized in small cytoplasmic vesicles of unknown f\inction. 
Differences in localization of mutant and normal presenilins may, 
therefore, contribute to the etiology of presenilin-related 
5 diseases. Compounds which can affect the localization of the 
presenilins may, therefore, be identified as potential 
therapeutics. Standard techniques known in the art may be 
employed to detect the localization of the presenilins. 
Generally, these techniques will employ the antibodies of the 

10 present invention, and in particular antibodies which selectively 
bind to one or more mutant presenilins but not to normal 
presenilins. As is well known in the art, such auitibodies may be 
labeled by any of a variety of techniques (e.g., fluorescent or 
radioactive tags, labeled secondary smtibodies, avidin-biotin, 

15 etc.) to aid in visualizing the intracellular location of the 
presenilins. The presenilins may be co-localized to particular 
structures, as in known in the art, using auitibodies to markers 
of those structures (e.g., TGN38 for the Golgi, transferrin 
receptor for post-Golgi transport vesicles, LAMP2 for lysosomes) . 

20 Western blots of purified fractions from cell lysates enriched 
for different intracellular membrane bound organelles (e.g., 
lysosomes, synaptosomes, Golgi) may also be employed. In 
addition, the relative orientation of different domains of the 
presenilins across cellular domains may be assayed using, for 

25 example, electron microscopy and antibodies raised to those 
domains . 

B. Ion Reoulation/Metaboliam 

In another series of embodiments, compoxinds may be screened 
for their ability to modulate the activity of the presenilins 

30 based upon measures in intracellular Ca^*, Na^ or levels or 
metabolism. As noted above, the presenilins are membrane 
associated proteins which may serve as, or interact with, ion 
receptors or ion channels. Thus, compounds may be screened for 
their ability to modulate presenilin-related calcium or other ion 

35 metabolism either in vivo or in vitro by measurements of ion 
channel fluxes and/or transmembrane voltage or current fluxes 
using patch clamp, voltage clamp and fluorescent dyes sensitive 
to intracellular calcium or transmembraoie voltage, ion channel 
or receptor function caxx also be assayed by measurements of 

40 activation of second messengers such as cyclic AMP, cGMP tyrosine 
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kinases, phosphates, increases in intracellular Ca^* levels, etc. 
Recombinantly made proteins may also be reconstructed in 
artificial membrane systems to study ion channel conductance and, 
therefore, the 6cell6 employed in such assays may comprise an 
5 artificial membrane or cell. Assays for changes in ion 

regulation or metabolism can be performed on cultured cells 
expressing endogenous normal or mutant presenilins. Such studies 
also can be performed on cells trans feet ed with vectors capable 
of expressing one of the presenilins, or functional domains of 
10 one of the presenilins, in normal or mutant form. In addition, 
the enhance the signal measured in such assays, cells may be co- 
transfected with genes encoding ion channel proteins. For 
example, X^y^op^s oocytes or rat kidney (HEK293) cells may be co- 
trcuisfected with normal or mutant presenilin sequences and 
15 sequences encoding rat brain Na* 01 subunits, reQjbit skeletal 

muscle Ca'* ffl siibunits, or rat heart K" 01 subunits. Changes in 
presenilin- related or presenilin*mediated ion channel activity 
can be measured by two-microelectrode voltage-clamp recordings in 
oocytes or by whole-cell patch-clan^ recordings in HBX29a cells. 
20 c. ApQptgg^g ffK P^flth 

In another series of embodiments, compoxmds may be screened 
for their ability to modulate the activity of the presenilins 
based upon their effects on presenilin-related or presenilin- 
mediated apoptosis or cell death. Thus, for example, baseline 
25 rates of apoptosis or cell death may be established for cells in 
culture, or the baseline degree of neuronal loss at a particular 
age may be established post-mortem for animal models or human 
subjects^ and the -ability of a candidate compound to suppress or 
inhibit apoptosis or cell death may be measured. Cell death may 
30 be measured by standard microscopic techniques (e.g., light 
microscopy) or apoptosis may be measured more specifically by 
characteristic nuclear morphologies or DNA fragmentation patterns 
which create nucleosomal ladders (see, e.g., Gavrieli et al., 
1992; Jacobson et al . , 1993; Vito et al . , 1996). TUNEL may also 
35 be employed to evaluate cell death in brain (see, e.g., Lassmann 
et al., 1995). In preferred embodiments, compo\inds are screened 
for their ability to suppress or inhibit neuronal loss in the 
transgenic animal models of the invention. Transgenic mice 
bearing, for example, a mutant hxaraan, mutant mouse, or humanized 
40 mutant presenilin gene may be employed to identify or evaluate 
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compoimds which may delay or arrest the neurodegeneration 
associated with AlzheimerOs Disease. A similar transgenic mouse 
model, bearing a mutant APP gene, has recently been reported by 
Games et al. (1995). 
5 D. M Pgpt;A49 PirQduytAg?nt 

In another series of embodiments, compounds may be screened 
for their ability to modulate presenilin- related or presenilin- 
mediated changes in AFP processing. The A/S peptide is produced 
in several isoforms resulting from differences in APP processing. 

10 The A/5 peptide is a 39 to 43 amino acid derivative of /SAPP which 
is progressively deposited in diffuse and senile plagues and in 
blood vessels of subjects with AD. In human brain, A0 peptides 
are heterogeneous at both the N- and C-termini. Several 
observations, however, suggest that both the full length and N- 

15 terminal trxmcated forms of the long- tailed A0 peptides ending at 
residue 42 or 43 (i.e., A/31-42/43 and A^x-42/43) have a more 
important role in AD than do peptides ending at residue 40. 
Thus, A/31-42/43 and A^x-42/43 are an early and prominent feature 
of both senile placpies cuid diffuse plac[ues, while peptides ending 

20 at residue 40 (i.e., A/51-40 and A/?x-40) are predominantly 

associated with a subset of mature plaques and with amyloidotic 
blood vessels (see, e.g., Iwatsiibo et al., 1995; Gravina et al., 
1995; Tamaoka et al., 1995; Podlisny et al. 1995). Furthermore, 
the long-tailed isoforms have a greater propensity to fibril 

25 formation, cuid are thought to be more neurotoxic than A/7l*4 0 
peptides (Pike et al., 1993; Hilbich et al., 1991). Finally, 
missense mutations at codon 717 of the jSAPP gene associated with 
early onset FAD result in overproduction of long- tailed A/? in the 
brain of affected mutation carriers, in peripheral cells cuid 

30 plasma of both affected and presymptomatic carriers, auid in cell 
lines transfected with /SAPP^^^ mutemt cDNAs (Tamaoka et al., 1994; 
Suzuki et al., 1994) As described in Example 18 below, we now 
disclose that increased production of the long- forms of the A0 
peptide are also associated with mutations in the presenilin 
35 genes . 

Thus, in one series of embodiments, the present invention 
provides methods for screening candidate compounds for their 
ability to block or inhibit the increased production of long 
isoforms of the A^ peptides in cells or transgenic animals 
40 expressing a mutant presenilin gene. In particular, the present 
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invention provides such methods in which cultured mammalian 
cells, such as brain cells or fibroblasts, have been transformed 
according to the methods disclosed herein, or in which transgenic 
animals, such as rodents or non-human primates, have been 
5 produced by the methods disclosed herein, to express relatively 
high levels of a mutant presenilin.- Optionally, such cells or 
transgenic animals may also be transformed so as to express a 
normal form of the /SAPP protein at relatively high levels. 

In this series of embodiments, the ccuididate compound is 

10 administered to the cell line or transgenic animals (e.g., by 
addition to the media of cells in culture; or by oral or 
parenteral administration to an animal) and, after an appropriate 
period (e.g., 0-72 hours for cells in culture, days or months for 
animal models), a biological sample is collected (e.g., cell 

15 culture supernatant or cell lysate from cells in culture; tissue 
homogenate or plasma from an animal) and tested for the level of 
the long isoforms of the AP peptides. The levels of the peptides 
may be determined in am absolute sense (e.g., nMol/ml) or in a 
relative sense (e.g., ratio of long to short A/? isoforms). The 

20 A/J isoforms may be detected by any means )mown in the art (e.g., 
electrophoretic separation and secjuencing) but, prefersUbly, 
auxtibodies which are specific to the long isoform are employed to 
determine the absolute or relative levels of the A^l -42/43 or 
A^x-42/43 peptides. Candidate pharmaceuticals or therapies which 

25 reduce the aUdsolute or relative levels of these long A0 isoforms, 
particularly in the transgenic euiimal models of the invention, 
are likely to have therapeutic utility in the treatment of 
Alzheimer's Disease, or other disorders caused by presenilin 
mutations or aiberrations in AFP metabolism. 

30 E. Phosphorylation of Microtubule Associated Proteins 

In another series of embodiments, cauididate compo\inds may be 
screened for their ability to modulate presenilin activity by 
assessing the effect of the compound on levels of phosphorylation 
of microtubule associated proteins (MAPs) such as Tau. The 

35 abnormal phosphorylation of Tau and other MAPs in the brains of 
victims of AlzheimerOs Disease is well known in the art. Thus, 
compounds which prevent or inhibit the abnormal phosphorylation 
of MAPS may have utility in treating presenilin associated 
diseases such as AD. As above, cells from normal or mutant 

40 animals or subjects, or the transformed cell lines and animal 
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models of the invention may be employed. Preferred assays will 
employ cell lines or animal models transformed with a mutant 
human or humanized mutant presenilin gene. The baseline 
phosphorylation state of MAPs in these cells may be esteUDlished 
5 and then candidate confounds may be tested for their ability to 
prevent, inhibit or counteract the hyperphosphorylation 
associated with mutants. The phosphorylation state of the MAPs 
may be determined by any standard method known in the art but, 
preferably, antibodies which bind selectively to phosphorylated 
10 or unphosphorylated epitopes are employed. Such antibodies to 
phosphorylation epitopes of the Tau protein are known in the art 
(e.g. , ALZ50) . 

9. Screening and Diaonostics for Alzheimer^ s Disease 
A. General Diagnostic Methods 

15 The presenilin genes and gene products, as well as the 

presenilin-derived probes, primers and antibodies, disclosed or 
otherwise enabled herein, are useful in the screening for 
carriers of alleles associated with Alzheimer's Disease, for 
diagnosis of victims of Alzheimer's Disease, and for the 

20 screening and diagnosis of related presenile and senile 
dementias, psychiatric diseases such as schizophrenia and 
depression, and neurologic diseases such as stroke and cerebral 
hemorrhage, all of which are seen to a greater or lesser extent 
in sytt^tomatic human subjects bearing mutations in the PSl or PS2 

25 genes or in the APP gene. Individuals at risk for Alzheimer's 

Disease, such as those with AD present in the family pedigree, or 
individuals not previously known to be at risk, may be routinely 
screened using probes to. detect the presence of a mutant 
presenilin gene or protein by a variety of techniques. Diagnosis 

30 of inherited cases of these diseases can be accomplished by 
methods based upon the nucleic acids (including genomic and 
mRNA/cDNA sequences), proteins, and/or antibodies disclosed and 
enabled herein, including functional assays designed to detect 
failure or augmentation of the normal presenilin activity and/or 

35 the presence of specific new activities conferred by the mutant 
presenilins. PreferaJdly, the methods and products are based upon 
the human PSl or PS2 nucleic acids, proteins or antibodies, as 
disclosed or otherwise enabled herein. As will be obvious to one 
of ordinary skill in the art, however, the significant 

40 evolutionary conservation of large portions of the PSl and PS2 
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nucleotide and amino acid sequences, even in species as diverse 
as humans, mice, C, alegars, and Drosophila . allow the skilled 
artisan to make use of such non-human presenilin-homologue 
nucleic acids, proteins and antibodies, even for applications 
5 directed toward human or other animal subjects. Thus, for 
brevity of exposition, but without limiting the scope of ' the 
invention, the following description will focus upon uses of the 
human homologues of PSl and PS2. It will be understood, however, 
that homologous sequences from other species, including those 
10 disclosed herein, will be equivalent for many purposes. 

As will be appreciated by one of ordinaury skill in the art, 
the choice of diagnostic methods of the present invention will be 
influenced by the nature of the available biological samples to 
be tested and the nature of the information required. PSl, for 
15 example, is highly expressed in brain tissue but brain biopsies 
are invasive and expensive procedures, particularly for routine 
screening. Other tissues which express PSl at significant 
levels, however, may demonstrate alternative splicing (e.g., 
lymphocytes) and, therefore, PSl mRNA or protein from such cells 
20 may be less informative. Thus, an assay based upon a subject's 
genomic PSl DNA may be the preferred because no information will 
be dependent upon alternative splicing and because essentially 
amy nucleate cells may provide a usable sample. Diagnostics 
based upon other presenilins (e.g., hPS2, mPSl) are subject to 
25 similar considerations: availability of tissues, levels of 

expression in various tissues, and alternative mRNA and protein 
products resulting from alternative splicing. 
B. Protein Based Screens and Diagnostics 

When a diagnostic assay is to be based upon presenilin 
30 proteins, a variety of approaches are possible. For example, 
diagnosis can be achieved by monitoring differences in the 
electrophoretlc mobility of normal and mutant proteins. Such an 
approach will be particularly useful in identifying mutants in 
which charge substitutions are present, or in which insertions, 
35 deletions or substitutions have resulted in a significant change 
in the electrophoretlc migration of the resultant protein. 
Alternatively, diagnosis may be based upon differences in the 
proteolytic cleavage patterns of normal and mutant proteins, 
differences in molar ratios of the various amino acid residues, 
40 or by fimctional assays demonstrating altered foinction of the 
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gene products. 

In preferred embodiments, protein-based diagnostics will 
employ differences in the ability of antibodies to bind to normal 
and mutant presenil in proteins (especially hPSl or hPS2) . Such 
5 diagnostic tests may employ cuitibodies which bind to the normal 
proteins but not to mutant proteins, or vice versa. In 
particular, an assay in which a plurality of monoclonal 
antibodies, each capable of binding to a mutant epitope, may be 
employed. The levels of ant i -mutant antibody binding in a sample 
10 obtained from a test s\xbject (visualized by, for example, 

radiolabelling, ELISA or chemiluminescence) may be compared to 
the levels of binding to a control sample. Alternatively, 
antibodies which bind to normal but not mutant presenilins may be 
employed, and decreases in the level of antibody binding may be 
15 used to distinguish homozygous normal individuals from mutant 
heterozygotes or homozygotes. Such antibody diagnostics may be 
used for in situ immunohistochemistry using biopsy samples of CNS 
tissues obtained antemortera or postmortem, including 
neuropathological structures associated with these diseases such 
20 as neurofibrillary tangles and amyloid plaques, or may be used 
with fluid sanqples such a cerebrospinal fluid or with peripheral 
tissues such as white blood cells. 
C. Nucleic Acid Based Screen s and Diaonosties 

When the diagnostic assay is to be based upon nucleic acids 
25 from a sample, the assay may be based upon mRNA, cDNA or genomic 
DMA. When mRNA is used from a sample, many of the same 
considerations apply with respect to source tissues and the 
possibility- of alternative splicing. That is, there may be 
little or no expression of tramscripts unless appropriate tissue 
30 sources are chosen or available, and alternative splicing may 
result in the loss of some information or difficulty in 
interpretation. However, we have already shown (Sherrington et 
al., 1995; Rogaev, 1995) that mutations in the 5' UTR, 3' UTR, 
open reading frame and splice sites of both PSl and PS2 can 
35 reliably be identified in mRNA/cDNA isolated from white blood 

cells and/or skin fibroblasts- Whether mRNA, cDNA or genomic DNA 
is assayed, standard methods well known in the art may be used to 
detect the presence of a particular sequence either in situ or in 
vi£i2 (see, e.g., Sambrook et al., (1989) Molecular Cloning: A 
40 Laboratory Manual. 2nd ed. , Cold Spring Harbor Press, Cold Spring 
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Harbor, NY) . As a general matter, however, any tissue with 
nucleated cells may be examined. 

Genomic DNA used for the diagnosis may be obtained from body 
cells, such as those present in the blood, tissue biopsy, 
5 surgical specimen, or autopsy material. The DNA may be isolated 
and used directly for detection of a specific sequence or may be 
amplified by the polymerase chain reaction (PGR) prior to 
analysis. Similarly, RNA or cDNA may also be used, with or 
without PGR amplification. To detect a specific nucleic acid 

10 sec[uence, direct nucleotide sequencing, hybridization using 

specific oligonucleotides, restriction enzyme digest and mapping, 
PGR mapping, RNase prot;ection, chemical mismatch cleavage, 
ligase -mediated detection, and various other methods may be 
employed. Oligonucleotides specific to particular sequences can 

15 be chemically synthesized and labeled radioactively or non- 
radioactively (e.g., biotin tags, ethidium bromide) , and 
hybridized to individual samples immobilized on membranes or 
other solid- supports (e.g., by dot-blot or transfer from gels 
after electrophoresis) , or in solution. The presence or absence 

20 of the target sequences may then be visualized using methods such* 
as autoradiography, fluorometry, or colorimetry. These 
procedures can be automated using redundant , short 
oligonucleotides of Jcnown sequence fixed in high density to 
silicon chips. 

25 (1) Appropriate Probes and Primers 

Whether for hybridization, RNase protection, ligase -mediated 
detection, PGR amplification or any other standards methods 
described- herein and well known in the art, a variety of 
subsequences of the presenilin sequences disclosed or otherwise 

30 enabled herein will be useful as probes and/or primers. These 
secjuences or subsequences will include both normal presenilin 
sequences and deleterious mutant sequences. In general, useful 
sequences will include at least 8-9, more preferably 10-50, euid 
most preferaibly 18-24 consecutive nucleotides from the presenilin 

35 introns, exons or intron/exon boundaries. Depending upon the 
target sequence, the specificity required, and future 
technological developments, shorter sequences may also have 
utility. Therefore, any presenilin derived sequence which is 
employed to isolate, clone, amplify, identify or otherwise 

40 manipulate a presenilin sequence may be regarded as an 



SUBSTITUTE SHEET (RULE 26) 



WO9d/34099 



PCT/CA96/002d3 



- 85 - 

appropriate probe or primer. Particularly conteinplated as useful 
will be secjuences including nucleotide positions from the 
presenilin genes in which disease-causing mutations are known to 
be present « or sequences which flank these positions. 
5 (a) PSl Probes and Primers 

As discussed above, a variety of disease- causing mutations 
have now been identified in the huxneui PSl gene. Detection of 
these and other PSl mutations is now enabled using isolated 
nucleic acid probes or primers derived from normal or mutant PSl 

10 genes. Particularly contemplated as useful are probes or primers 
derived from sequences encoding the N- terminus, the TM1-TM2 
region, and the TM6-TM7 region. As disclosed above, however, 
mutations have already been detected which affect other regions 
of the PSl protein and, using the methods disclosed herein, more 

15 will undoubtedly be detected. Therefore, the present invention 
provides isolated nucleic acid probes and primers corresponding 
to normal and mutant sequences from any portion of the PSl gene, 
including introns and 5' and 3' UTRs, which may be shown to be 
associated with the development of Alzheimer's Disease. 

20 Merely as an example, and without limiting the invention, 

probes and primers derived from the hPSl DNA segment immediately 
surrounding the C410Y mutation may be employed in screening and 
diagnostic methods. This mutation arises, at least in some 
individuals, from the siibstitution of em A for a G at position 

25 1477 of SEQ ID NO: 1. Thus, genomic DNA, mRNA or cDNA acquired 
from peripheral blood samples from an individual can be screened 
using oligonucleotide probes or primers including this 
potentially -mutant site. For hybridization. probes for this 
mutation, probes of 8-50, and more preferably 18-24 bases 

30 spanning the mutation site (e.g., bp 1467-1467 of SEQ ID NO: 1) 

may be employed. If the probe is to be used with mRNA, it should 
of course be complementary to the mRNA {and, therefore, 
correspond to the non-coding strand of the PSl gene. For probes 
to be used with genomic DNA or cDNA, the probe may be 

35 complementary to either strand. To detect sequences including 
this mutation by PCR methods, appropriate primers would include 
sequences of 8-50, and preferably 18-24, nucleotides in length 
derived from the regions flanking the mutation on either side, 
and which correspond to positions anywhere from 1 to 1000 bp, but 

40 preferably 1-200 bp, removed from the site of the mutation. PCR 
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primers which are 5' to the mutation site (on the coding strand) 
should correspond in sequence to the coding strand of the PSl 
gene whereas PCR primers which are 3' to the mutation site (on 
the coding strand) should correspond to the non- coding or 
5 antisense strand (e.g., a 5' primer corresponding to bp 1451-1468 
of SEQ ID NO: 1 and a 3' primer corresponding to the complement 
of 719-699 of SEQ ID NO: 14) . 

Similar primers may be chosen for other PSl mutations or for 
the mutational "hot spots" in general. For example, a 5' PCR 
10 primer for the M146L mutation (A-*C at bp 684) may comprise a 

sequence corresponding to approximately bp 601-620 of SEQ ID NO: 

I and a 3' primer may correspond to the complement of 
approximately bp 1328-1309 of SEQ ID NO: 8. Note that this 
example employs primers from both intronic and exonic sequences. 

15 As another example, an appropriate 5' primer for the A246E 

mutation (C-»A at bp 985) may comprise a sequence corresponding to 
approximately bp 907-925 of SEQ ID NO: 1 or a 3 ' primer 
corresponding to the complement of approximately bp 1010-990 of 
SEQ ID NO: 1. As another example, a 5' primer for the H163R 

20 mutation (A-^ at bp 736 of SEQ ID NO: 1 or bp 419 of SEQ ID NO: 
9) comprising a sequence corresponding to approximately bp 354- 
375 of SEQ ID NO: 9 with a 3' primer corresponding to the 
complement of approximately bp 581-559 of SEQ ID NO: 9. 
Similarly, intronic or exonic sequences may be employed, for 

25 example, to produce a 5' primer for the L286V mutation (C-#6 at bp 
1104 of SEQ ID NO: 1 or bp 398 of SEQ ID NOr 11) comprising a 
sequence corresponding to approximately bp 249-268 of SEQ ID NO: 

II or bp 1020-1039 of SEQ ID NO:- !,- and-a 3' primer corresponding 
to the complement of approximately bp 510-491 of SEQ ID NO: 11. 

30 It should also be noted that the probes and primers may 

include specific mutated nucleotides. Thus, for example, a 
hybridization probe or 5' primer may be produced for the C410Y 
mutation comprising a sequence corresponding to approximately bp 
1468-1486 of SEQ ID NO: 1 to screen for or amplify normal 

35 alleles, or corresponding to the same sequence but with the bp 
corresponding to bp 1477 altered (G-»T) to screen for or amplify 
mutant alleles, 
(b) PS2 Probes and Primers 

The same general considerations described above with respect 

40 to probes and primers for PSl, apply equally to probes and 
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primers for PS2. In particular, the probes or primers may 
correspond to intron, exon or intron/exon boundary sequences, may 
correspond to sequences from the coding or non-coding (cuitisense) 
strands, and may correspond to normal or mutant sequences. 
5 Merely as exatt^les, the PSl N141I mutation {A-*T at bp 787) 

may be screened for by PGR amplification of the surrounding DNA 
fragment using a 5' primer corresponding to approximately bp 733- 
751 of SEQ ID NO: 18 and a 3' primer corresponding to the 
complement of approximately bp 846-829 of SEQ ID NO: 18. 

10 Similarly, a 5' primer for the M239V mutation (A-*G at bp 1080) 
may comprise a sequence corresponding to approximately bp 1009- 
1026 and a 3' primer may correspond to the complement of 
approximately bp 1118-1101 of SEQ ID NO: 18. As another example, 
the sequence encoding the region surrounding the I420T mutation 

15 (T-^ at bp 1624) may be screened for by PGR amplification of 

genomic DNA using a 5' primer corresponding to approximately bp 
1576-1593 of SEQ ID NO: 18 and a 3' primer corresponding to the 
complement of approximately bp 1721-1701 of SEQ ID NO: 18 to 
generate a 146 base pair product. This product may, for example, 

20 then be probed with allele specific oligonucleotides for the 
wild-type (e.g., bp 1616-1632 of SEQ ID NO: 18) and/or mutant 
(e.g., bp 1616-1632 of SEQ ID NO: 18 with T-*C at bp 1624) 
sequences . 

(2) Hyb2rj.(^j,:z;ac;L9n gcye^njnq 

25 For in situ detection of a normal or mutant PSl, PS2 or 

other presenilin-related nucleic acid sequence, a sample of 
tissue may be prepared by standard techniques and then contacted 
with one or more of the above-described probes, preferably one 
which is IsOaeled to facilitate detection, and an assay for 

30 nucleic acid hybridization is conducted under stringent 

conditions which permit hybridization only between the probe and 
highly or perfectly complementary sequences. Because most of the 
PSl and PS2 mutations detected to date consist of a single 
nucleotide substitution, high stringency hybridization conditions 

35 will be required to distinguish normal sequences from most mutant 
sequences. When the presenilin genotypes of the subject's 
parents are known, probes may be chosen accordingly. 
Alternatively, probes to a variety of mutants may be employed 
sequentially or in combination. Because most individuals 

40 carrying presenilin mutants will be heterozygous, probes to 
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normal sequences also may be employed and homozygous normal 
individuals may be distinguished from mutant heterozygotes by the 
amount of binding (e.g., by intensity of radioactive signal). In 
another variation, competitive binding assays may be employed in 
5 which both normal and mutant probes are used but only one is 
labeled. 

(3) ^e?t;y;^<?tAP^ MftPPjlnq 

Sequence alterations may also create or destroy fortuitous 
restriction enzyme recognition sites which are revealed by the 

10 use of appropriate enzyme digestion followed by gel-blot 
hybridization. DMA fragments carrying the site (normal or 
mutant) are detected by their increase or reduction in size, or 
by the increase or decrease of corresponding restriction fragment 
numbers. Such restriction fragment length polymorphism analysis 

15 (RFLP) , or restriction mapping, may be employed with genomic DNA, 
mRNA or cDNA. The presenilin sequences may be amplified by PGR 
using the above -described primers prior to restriction, in which 
case the lengths of the PGR products may indicate the presence or 
absence of particular restriction sites, and/or may be subjected 

20 to restriction after amplification. The presenilin fragments may 
be visualized by any convenient means (e.g., under UV light in 
the presence of ethidixxm bromide) . 

Merely as examples, it is noted that the PSl M146L mutation 
{A-»G at bp 684 of SEQ ID NO: 1} destroys a PsphI site; the H163R 

25 mutation (A-«G at bp 736) destroys an Nlalll site; the A246E 
mutation <G-»A at bp 985) creates a Ddel site; and the L286V 
mutation (G-»G at bp 1104) creates a PvuIII site. One of ordinary 
8)cill in the art may easily choose from the many commercially 
available restriction enzymes and, based upon the normal and 

30 mutsuit sequences disclosed and otherwise enabled herein, perform 
a restriction mapping analysis which will detect virtually any 
presenilin mutation. 

(4) pqR M^ftPiW 

In another series of embodiments, a single base substitution 
35 mutation may be detected based on differential PGR product length 
or production in PGR. Thus, primers which span mutauit sites or 
which,, preferably, have 3' termini at mutation sites, may be 
employed to amplify a sample of genomic DNA, mRNA or cDNA from a 
subject. A mismatch at a mutational site may be expected to 
40 alter the ability of the normal or mutant primers to promote the 



SUBSTITUTE SHEET (RULE 26) 



wo 96/34099 



PCT/CA96/00263 



- 89 - 

polymerase reaction and, thereby, result in product profiles 
which differ between normal subjects and heterozygous and/or 
homozygous presenilin mutants. The PGR products of the normal 
and mutant gene may be differentially separated and detected by 
5 standard techniques, such as polyacrylamide or agarose gel 

electrophoresis and visualization with labeled probes, ethidium 
bromide or the like. Because of possible non-specific priming or 
readthrough of mutation sites, as well as the fact that most 
carriers of mut^ant alleles will be heterozygous, the power of 
10 this technique may be low. 

(5) Electrophoretie Mobility 

Genetic testing based on DNA secpience differences also may 
be achieved by detection of alterations in electrophoretie 
mobility of DNA, mRNA or cDNA fragments in gels. Small sequence 

15 deletions and insertions, for example, can be visualized by high 
resolution gel electrophoresis of single or double stranded DNA, 
or as chauiges in the migration pattern of DNA heteroduplexes in 
non- denaturing gel electrophoresis. Presenilin mutations or 
polymorphisms may also be detected by methods which exploit 

20 mobility shifts due to single- stranded conformational 

polymorphisms (SSCP) associated with mRNA or single-stranded DNA 
secondary structures. 

(6) Chemical Cleavage of Mismatches 

Mutations in the presenilins may also be detected by 

25 employing the chemical cleavage of mismatch (CCM) method (see, 

e.g., Saleeba and Cotton, 1993, and references therein). In this 
technique, probes (up to - l kb) may be mixed with a sample of 
genomic DNA, cDNA or mRNA obtained from a subject. The sample 
and probes are mixed and subjected to conditions which allow for 

30 heteroduplex formation (if any) . Preferably, both the probe and 
sample nucleic acids are double •stramded, or the probe and sample 
may be PGR amplified together, to ensxire creation of all possible 
mismatch heteroduplexes. Mismatched T residues are reactive to 
osmium tetroxide and mismatched C residues are reactive to 

35 hydroxylamine . Because each mismatched A will be accompanied by 
a mismatched T, and each mismatched G will be accompanied by a 
mismatched C, any nucleotide differences between the probe and 
sample (including small insertions or deletions) will lead to the 
formation of at least one reactive heteroduplex. After treatment 

40 with osmium tetroxide and/or hydroxylamine to modify any mismatch 
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sites, the mixture is subjected to chemical cleavage at any 
modified mismatch sites by, for exaunple, reaction with 
piperidine. The mixture may then be analyzed by stsmdard 
techniques such as gel electrophoresis to detect cleavage 
5 products which would indicate mismatches between the probe and 
sample. 

(7) Otl^er MethQ<ag 

Various other methods of detecting presenilin mutations, 
based upon the presenilin sequences disclosed and otherwise 

10 enabled herein, will be apparent to those of ordinary skill in 
the art. Any of these may be employed in accordance with the 
present invention. These include, but are not limited to, 
nuclease protection assays (SI or ligase-mediated) , ligated PCR, 
denaturing gradient gel electrophoresis (DGGE; see, e.g., Fischer 

15 and Lerman, 1983), restriction endonuclease fingerprinting 

combined with SSCP (REF-SSCP; see, e.g., Liu and Sommer, 1995), 
£uid the like. 

D. Other Screens and Diacmostics 

In inherited cases, as the primary event, and in non- 
20 inherited cases as a secondary event due to the disease state, 
abnormal processing of FSl, PS2, APP, or proteins reacting with 
PSl, PS2, or APP may occur. This can be detected as abnormal 
phosphorylation, glycosylation, glycation amidation or 
proteolytic cleavage products in body tissues or fluids (e.g., 
25 CSF or blood) . 

Diagnosis also can be made by observation of alterations in 
presenilin transcription, trauislation, and post-trauislational 
modification and processing as well as alterations in the 
intracellular and extracellular trafficking of presenilin gene 
30 products in the brain and peripheral cells. Such changes will 
include alterations in the amount of presenilin messenger RNA 
and/or protein, alteration in phosphorylation state, abnormal 
intracellular location/distribution, abnormal extracellular 
distribution, etc. Such assays will include: Northern Blots 
35 (with presenilin-specif ic and non-specific nucleotide probes) , 
Western blots and enzyme- linked immunosorbent as8ay& (ELISA) 
(with antibodies raised specifically to a presenilin or 
presenilin functional domain, including various post- 
translational modification states including glycosylated and 
40 ph sphorylated isoforms) . These assays can be performed on 
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peripheral tissues (e.g., blood cells, plasma, cultured or other 
fibroblast tissues, etc.) as well as on biopsies of CNS tissues 
obtained antemortem or postmortem, and upon cerebrospinal fluid. 
Such assays might also include in situ hybridization and 
5 immunohistochemistry (to localize messenger RNA and protein to 
specific subcellular compartments and/or within neuropathological 
structures associated with these diseases such as neurofibrillary 
tcuigles smd amyloid plaques) . 
E. Screening and Diagnostic Kits 

10 In accordance with the present invention, diagnostic kits 

are also provided which will include the reagents necessary for 
the above-described diagnostic screens. For example, kits may be 
provided which include antibodies or sets of antibodies which are 
specific to one or more mutant epitopes. These antibodies may, 

15 in particular, be labeled by any of the standard meeuis which 

facilitate visualization of binding. Alternatively, kits may be 
provided in which oligonucleotide probes or PGR primers, as 
described above, are present for the detection euid/or 
amplification of mutant PSl, PS2 or other presenilin-related 

20 nucleotide sequences. Again, such probes may be labeled for 
easier detection of specific hybridization. As appropriate to 
the various diagnostic embodiments described above, the 
oligonucleotide probes or antibodies in such kits may be 
immobilized to substrates and appropriate controls may be 

25 provided. 

10. Methods of Treatment 

The present invention now provides a basis for therapeutic 
intervention in diseases which are caused, or which may be 
caused, by mutations in the presenilins. As detailed above, 

30 mutations in the hPSl and hPS2 genes have been associated with 
the development of early onset forms of Alzheimer's Disease and, 
therefore, the present invention is particularly directed to the 
treatment of subjects diagnosed with, or at risk of developing, 
Alzheimer's Disease. In view of the expression of the PSl and 

35 PS2 genes in a variety of. tissues, however, it is quite likely 

that the effects of mutations at these loci are not restricted to 
the brain and, therefore, may be causative of disorders in 
addition to Alzheimer's Disease. Therefore, the present 
invention is also directed at diseases manifest in other tissues 

40 which may arise from mutations, mis -express ion, mis-metabolism or 
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other inherited or acquired alterations in the presenilin genes 
and gene products. In addition, although Alzheimer's Disease 
manifests as a neurological disorder, this manifestation may be 
caused by mutations in the presenilins which first affect other 
5 organ tissues (e.g., liver), which then release factors which 
^ affect brain activity, and ultimately cause Alzheimer's Disease. 
Hence, in considering the various therapies described below, it 
is understood that such therapies may be targeted at tissue other 
than the brain, such as heart, placenta, lung, liver, skeletal 
10 muscle, kidney and pancreas, where PSl and/or PS2 are also 
expressed. 

Without being bound to any particular theory of the 
invention, the effect of the Alzheimer's Disease related 
mutations in the presenilins appears to be a gain of a novel 

15 function, or an acceleration of a normal fiinction, which directly 
or indirectly causes aberrant processing of the Amyloid Precursor 
Protein (APP) into A^ peptide, abnormal phosphorylation 
homeostasis, and/or abnormal apoptosis in the brain. Such a gain 
of function or acceleration of function model would be consistent 

20 with the adult onset of the symptoms and the dominant inheritance 
of Alzheimer's Disease. Nonetheless, the mechanism by which 
mutations in the presenilins may cause these effects remains 
\mknown. 

It is known that APP may be metabolized through either of 

25 two pathways. In the first, APP is metabolized by passage 
through the Golgi network and then to secretory pathways via 
clathrin-coated vesicles. Mature APP is then passaged to the 
plasma membrauie where it is cleaved by a-secretase to produce a 
soluble fraction (Protease Nexin II) plus a non-amyloidogenic C- 

30 terminal peptide (Selkoe et al., 1995; Gandy et al., 1993). 
Alternatively, mature APP can be directed to the endosome- 
lysosome pathway where it undergoes 0 and 7-secretase cleavage to 
produce the A^ peptides. The A/3 peptide derivatives of APP are 
neurotoxic (Selkoe et al., 1994). The phosphorylation state of 

35 the cell determines the relative balance between the a- secret ase 
(non-amyloidogenic) or A0 pathways (amyloidogenic pathway) (Gandy 
et al. 1993), and can be modified pharmacologically by phorbol 
esters, muscarinic agonists and other agents. The 
phosphorylation state of the cell appears to be mediated by 

40 cytos lie factors (especially protein kinase C) acting upon one 
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or more integral membrcuie proteins in the Golgi network. 

Without being bound to any particular theory of the 
invention, the presenilins, in particular hPSl or hPS2 (which 
carry several phosphorylation consensus se<iuences for protein 
5 kinase C) , may be the integral membrane proteins whose 

phosphorylation state determines the relative balance between the 
Qf-secretase and pathways. Thus, mutations in the PSl or PS2 
genes may cause alterations in the structure and function of 
their products leading to defective interactions with regulatory 

10 elements (e.g., protein kinase C) or with APP, thereby promoting 
APP to be directed to the amyloidogenic endosome-lysosome 
pathway. Environmental factors (e.g., viruses, toxins, or aging) 
may also have similar effects on PSl or PS2. 

Again without being bound to any particular theory of the 

15 invention, it is also noted that both the PSl and PS2 proteins 
have substantial amino acid sequence homology to htxman ion 
channel proteins and receptors. For instance, the PS2 protein 
shows substantial homology to the human sodium channel a-subunit 
(E-0.18, P-0.16, identities « 22-27% over two regions of at least 

20 35 amino acid residues) using the BLASTP paradigm of Altschul et 
al. (1990). Other diseases (such as malignant hyperthermia and 
hyperkalemic periodic paralysis in hxamans, and the degeneration 
of mechanosensory neurons in C. eleqans ) arise through mutations 
in ion channels or receptor proteins. Mutation of the PSl or PS2 

25 gene could, therefore, eif feet similar functions and lead to 
Alzheimer's Disease and/or other psychiatric and neurological 
diseases. 

Therapies to treat presenilin-associated diseases such as AD 
may be based upon (1) administration of normal PSl or PS2 

30 proteins, (2) gene therapy with normal PSl or PS2 genes to 
compensate for or replace the mutant genes, (3) gene therapy 
based upon antisense sequences to mutant PSl or PS2 genes or 
which "3cnock-out" the mutant genes, (4) gene therapy based upon 
sequences which encode a protein which blocks or corrects the 

35 deleterious effects of PSl or PS2 mutants, (5) immunotherapy 
based upon antibodies to normal and/or mutant PSl or PS2 
proteins, or (S) small molecules (drugs) which alter PSl or PS2 
expression, block abnormal interactions between mutant forms of 
PSl or PS2 and other proteins or ligands, or which otherwise 

40 block the aberrant function of mutant PSl or PS2 proteins by 
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altering the structure of the mutant proteins, by enhancing their 
metabolic clearance, or by inhibiting their function. 

A. Protein Therapy 

Treatment of presenilin-related Alzheimer's Disease, or 
5 other disorders resulting from presenilin mutations, may be 

performed by replacing the mutauit protein with normal protein, by 
modulating the function of the mutant protein, or by providing an 
excess of normal protein to reduce the effect of any aberrant 
fxuiction of the mutant proteins. 

10 To accomplish this, it is necessary to obtain, as described 

and enabled herein, large amounts of substantially pure PSl 
protein or PS2 protein from cultured cell systems which can 
express the protein. Delivery of the protein to the affected 
brain areas or other tissues can then be accomplished using 

15 appropriate packaging or administrating systems including, for 
example,, liposome mediated protein delivery to the target cells. 

B. Gene Theranv 

In one series of embodiments, gene therapy is may be 
employed in which normal copies of the PSl gene or the PS2 gene 

20 are introduced into patients to code successfully for normal 

protein in one or more different affected cell types. The gene 
must be delivered to those cells in a form in which it can be 
taken up and code for sufficient protein to provide effective 
fxuiction. Thus, it is preferred that the recombincuit gene be 

25 operably joined to a strong promote so as to provide a high level 
of expression which will compensate for, or out-compete, the 
mutant proteins. As noted above, the recombinant construct may 
contain endogenous or exogenous regulatory elements, inducible or 
repressible regulatory elements, or tissue-specific regulatory 

30 elements. 

In another series of embodiments, gene therapy may be 
employed to replace the mutant gene by homologous recombination 
with a recombinant construct. The recombinant construct may 
contain a normal copy of the targeted presenilin gene, in which 

35 case the defect is corrected in situ , or may contain a "knock- 
out" construct which introduces a stop codon, missense mutation, 
or deletion which abolished function of the mutant gene. It 
should be noted in this respect that such a construct may knock- 
out both the normal and mutant copies of the targeted presenilin 

40 gene in a heterozygous individual, but the total loss of 
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presenilin gen function may be less deleterious to the 
individual than continued progression of the disease state. 

In another series of embodiments, antisense gene therapy may 
be employed. The antisense therapy is based on the fact that 
5 sequence-specific suppression of gene expression can be achieved 
by intracellular hybridization between mRNA or DNA and a 
complementary antisense species. The formation of a hybrid 
duplex may then interfere with the transcription of the gene 
and/or the processing, transport, translation and/or stability of 

10 the target presenilin mRNA. Antisense strategies may use a 

variety of approaches including the administration of antisense 
oligonucleotides or antisense oligonucleotide cuialogs (e.g., . 
analogs with phosphorothioate backbones) or trcinsf ection with 
antisense RNA expression vectors. Again, such vectors may 

15 include exogenous or endogenous regulatory regions, inducible or 
repressible regulatory elements, or tissue-specific regulatory 
elements . 

In auiother series of embodiments, gene therapy may be used 
to introduce a recombinant construct encoding a protein or 

20 peptide which blocks or otherwise corrects the aberrant function 
caused by a mutsuit presenilin gene. In one embodiment, the 
recombinant gene may encode a peptide which corresponds to a 
mutant domain of a presenilin which has been found to abnormally 
interact with ^mother cell protein or other cell ligand. Thus, 

25 for example, if a muteuit TM6-»7 domain is found to interact with a 
particular cell protein but the corresponding normal TM6-#7 domain 
does not undergo this interaction, gene therapy may be employed 
to provide an excess of the mutant TM6-»7 domain which may compete 
with the mutant protein and inhibit or block the aberrant 

30 interaction. Alternatively, the portion of a protein which 
interacts with a mutant, but not a normal, presenilin may be 
encoded and expressed by a recombinant construct in order to 
con^ete with, and thereby inhibit or block, the aberrant 
interaction. Finally, in another embodiment, the same effect 

35 might be gained by inserting a second mutsuit protein by gene 
therapy in an approach similar to the correction of the "Deg 
1(d)" and "Mec 4(d)" mutations in C. eleaana by insertion of 
mutant trauisgenes. 

Retroviral vectors can be used for somatic cell gene therapy 

40 especially because of their high efficiency of infection and 
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Stable integration and expression. The targeted cells however 
must be able to divide and the expression of the levels of normal 
protein should be high because the disease is a dominant one. 
The full length PSl or PS2 genes, subsequences encoding 
5 functional domains of the presenilins, or any of the other 
therapeutic peptides described above, can be cloned into a 
retroviral vector and driven from its endogenous promoter, from 
the retroviral long terminal repeat, or from a promoter specific 
for the target cell type of interest (e.g., neurons). Other 

10 viral vectors which can be used include adeno-associated virus, 
vaccinia virus, bovine papilloma virus, or a herpes virus such as 
Epstein-Barr virus. 
C. Immunotherapv 

Immunotherapy is also possible for Alzheimer's Disease. 

15 Antibodies are raised to a mutant PSl or PS2 protein (or a 

portion thereof) and are administered to the patient to bind or 
block the mutant protein and prevent its deleterious effects. 
Simultaneously, expression of the normal protein product could be 
encouraged. Alternatively, antibodies are raised to specific 

20 complexes between mutant or wild- type PSl or PS2 and their 
interaction partners. 

A further approach is to stimulate endogenous antibody 
production to the desired antigen. Administration could be in 
the form of a one time immunogenic preparation or vaccine 

25 immiinization. An immwogenic composition may be prepared as 

injectables, as liquid solutions or emulsions. The PSl or PS2 
protein or other antigen may be mixed with pharmaceutically 
acceptable excipients compatible with the protein. Such 
excipients may include water, saline, dextrose, glycerol, ethanol 

30 and combinations thereof. The immunogenic composition and 
vaccine may further contain auxiliary substances such as 
emulsifying agents or adjuvants to enhance effectiveness. 
Immunogenic compositions and vaccines may be administered 
parenteral ly by injection subcutaneously or intramuscularly. 

35 The immunogenic preparations and vaccines are administered 

in such amoiint as will be therapeutically effective, protective 
and immunogenic. Dosage depends on the route of administration 
and will vary according to the size of the host. 
D. Small Molecule Therapeutics 

40 As described and enabled herein, the present invention 
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provides for a. number o£ methods of identifying small molecules 
or other compounds which may be useful in the treatment of 
Alzheimer's Disease or other disorders caused by mutations in the 
presenilins. Thus, for example, the present invention provides 
5 for methods of identifying presenilin binding proteins and, in 
particular, methods for identifying proteins or other cell 
consonants which bind to or otherwise interact with mutant 
presenilins but not with the normal presenilins. The invention 
also provides for methods of identifying small molecules which 

10 caui be used to disrupt aberrant interactions between mutant 
presenilins and such proteins or other cell components. 

Such interactions, involving mutant but not normal 
presenilins, not only provide information useful in understanding 
the biochemical pathways disturbed by mutations in the 

15 presenilins, and causative of Alzheimer's Disease, but also 
provide immediate therapeutic targets for intervention in the 
etiology of the disease. By identifying these proteins and 
analyzing these interactions, it is possible to screen for or 
design compounds which co\interact or prevent the interaction, 

20 thus providing possible treatment for abnormal interactions. 

These treatments would alter the interaction of the presenilins 
with these partners, alter the function of the interacting 
protein, alter the amount or tissue distribution or expression of 
the interaction partners, or alter similar properties of the 

25 presenilins themselves. 

Therapies can be designed to modulate these interactions and 
thus to modulate Alzheimer's Disease and the other conditions 
associated with acquired or inherited abnormalities of the PSl or 
PS2 genes or their gene products. The potential efficacy of 

30 these therapies can be tested by analyzing the affinity and 

f\inction of these interactions after exposure to the therapeutic 
agent by standard pharmacokinetic measxxrements of affinity (Kd 
and Vmax etc.) using synthetic peptides or recombinant proteins 
corresponding to functional domains of the PSl gene, the PS2 gene 

35 or other presenilin homologues. Another method for assaying the 
effect of any interactions involving functional domains such as 
the hydrophilic loop is to monitor changes in the intracellular 
trafficking and post-translational modification of the relevant 
genes by in situ hybridization, immunohistochemistry, Western 

40 blotting and metabolic pulse-chase labeling studies in the 
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presence of, and in the absence of, the therapeutic agents. A 
further method is to monitor the effects of "downstream" events 
including (i) changes in the intracellular metabolism, 
trafficking and targeting of APP and its products; (ii) changes 
5 in second messenger events, e.g., cAMP intracellular Ca^*, protein 
kinase activities, etc. 

As noted above, the presenilins may be involved in APP 
metabolism and the phosphorylation state of the presenilins may 
be critical to the balcuice between the or-secretase and A0 

10 pathways of APP processing. Using the transformed cells and 

animal models of the present invention, one is enabled to better 
understand these pathways and the aberrant events which occur in 
presenilin mutants. Using this knpv^ledge, one may then design 
therapeutic strategies to counteract the deleterious affects of 

15 presenilin mutants. 

To treat Alzheimer's Disease, for example, the* 
phosphorylation state of PSl €uid/or can be altered by chemical 
and biochemical agents (e.g. drugs, peptides and other compoiinds) 
which alter the activity of protein kinase C suxd other protein 

20 kinases, or which alter the activity of protein phosphatases, or 
which modify the availability of PSX to be post-translationally 
modified. The interactions of kinases and phosphatases with the 
presenilin proteins, and the interactions of the presenilin 
proteins with other proteins involved in the trafficking of APP 

25 within the Golgi network, can be modulated to decrease 

trafficking of Golgi vesicles to the endosome-lysosome pathway, 
thereby inhibiting Afi peptide production. Such compounds will 
include peptide analogues of APP, PSl, PS2, and other presenilin 
homologues, as well as other interacting proteins, lipids, 

30 sugars, and agents which promote differential glycosylation of 

PSl, PS2 and/or their homologues; agents which alter the biologic 
half-life of presenilin mRNA or proteins, including antibodies 
and antisense oligonucleotides; and agents which act upon PSl 
and/or PS2 transcription. 

35 The effect of these agents in cell lines and whole animals 

can be monitored by monitoring transcription, translation, and 
post-translational modification of PSl and/ or PS2 (e.g. 
phosphorylation or glycosylation) , as well as intracellular 
trafficking of PSl and/or PS2 through various' intracellular and 

40 extracellular compartments. Methods for these studies include 
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Western and Northern blots, innnunoprecipitation after metabolic 
labelling (pulse-chase) with radio- labelled methionine and ATP, 
and immunohistochemistry. The effect of these agents can also be 
monitored using studies which examine the relative binding 
5 affinities and relative amounts of PSl and/or PS2 proteins 

involved in interactions with protein kinase C and/or APP, using 
either standard binding affinity assays or co-precipitation and 
Western blots using antibodies to protein kinase C, APP, PSl, 
PS2, or other presenilin homologues . The effect of these agents 
10 can also be monitored by assessing the production of A^ peptides 
by ELISA before and after exposure to the putative therapeutic 
agent (see, e.g., Huang et al., 1993). The effect can also be 
monitored by assessing the viability of cell lines after exposure 
to aluminum salts and/or the Aj5 peptides which are thought to be 

15 neurotoxic in Alzheimer's Disease. Finally, the effect of these 
agents can be monitored by assessing the cognitive function of 
animals bearing normal genotypes at APP and/or their presenilin 
homologues, bearing human APP transgenes (with or without 
mutations), bearing human presenilin transgenes (with or without 

20 mutations), or bearing any combination of these. 

similarly, as noted above, the presenilins may be involved 
in the regulation of Ca'* as receptors or ion channels. This role 
of the presenilins also may be explored using the transformed 
cell lines and animal models of the invention. Based upon these 

25 results, a test for Alzheimer's Disease can be produced to detect 
an abnormal receptor or an abnormal ion chamnel function related 
to abnormalities that are acquired or inherited in the presenilin 
genes and their products, or in one of the homologous genes and 
their products. This test can be accomplished either in vivo or 

30 in vitro by measurements of ion channel fluxes and/or 

transmembrane voltage or current fluxes using patch clamp, 
voltage clamp and fluorescent dyes sensitive to intracellular 
calcium or transmembrane voltage. Defective ion channel or 
receptor function can also be assayed by measurements of 

35 activation of second messengers such as cyclic AMP, cGMP tyrosine 
kinases, phosphates, increases in intracellular Ca^* levels, etc. 
Recombinantly made proteins may also be reconstructed in 
artificial membrane systems to study ion channel conductance. 
Therapies which affect Alzheimer's Disease (due to 

40 acquired/ inherited defects in the PSI gene or PS2 gene; due to 
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defects in other pathways leading to this disease such as 
mutations in APP; and due to environmental agents) can be tested 
by analysis of their ability to modify an abnormal ion channel or 
receptor function induced by mutation in a presenilin gene. 
5 Therapies could also be tested by their ability to modify the 
normal function of an ion channel or receptor capacity of the 
presenilin proteins. Such assays can be performed on cultured 
cells expressing endogenous normal or mutant PSi genes/gene 
products or PS2 genes/gene products. Such studies also can be 

10 performed on cells transfected with vectors capable of expressing 
one of the presenilins, or functional domains of one of the 
presenilins, in normal or mutant form. Therapies for Alzheimer's 
Disease can be devised to modify an abnormal ion channel or 
receptor function of the PSl gene or PS2 gene. Such therapies 

15 can be conventional drugs, peptides, sugars, or lipids, as well 
as antibodies or other ligands which affect the properties of the 
PSl or PS2 gene product. Such therapies can also be performed by 
direct replacement of the PSl gene and/or PS2 gene by gene 
therapy. In the case of an ion channel, the gene therapy could 

20 be performed using either mini -genes (cDNA plus a promoter) or 

genomic constructs bearing genomic DNA sequences for parts or all 
of a presenilin gene. Mutant presenilins or homologous gene 
sequences might also be used to counter the effect of the 
inherited or acquired abnormalities of the presenilin genes as 

25 has recently been done for replacement of the Mec 4 and Deg 1 in 
C. eleoans (Huang and Chalfie, 1994). The therapy might also be 
directed at augmenting the receptor or ion channel function of 
one homologue, such as the PS 2 gene, in order that it may 
potentially take over the functions of a mutant form of another 

30 homologue (e.g., a PSl gene rendered defective by acquired or 

inherited defects) . Therapy using auitisense oligonucleotides to 
block the expression of the mutant PSl gene or the mutant PS2 
gene, co-ordinated with gene replacement with normal PSl or PS2 
gene can also be applied using standard techniques of either gene 

35 therapy or protein replacement therapy. 

Example 1, Development of the genetic, physical "contia" and 

t;y?ingcripti9n^; mv Qt t^n? minima; gQ>^gqrq>q^t;4;niq r?qiQn> 

The CEPH MegaYAC and the RPCI PAC human total genomic DNA 
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libraries were searched for clones containing genomic DNA 
fragments from the AD3 region of chromosome 14q24.3 using 
oligonucleotide probes for each of the 12 SSR marker loci used in 
the genetic linkage studies as well as additional markers 
5 (Albertsen et al., 1990; Chumakov et al., 1992; loannu et al., 
1994) * The genetic map distsuices between each marker are 
depicted above the contig, and are derived from published data 
(NIH/CEPH Collaborative Mapping Group, 1992; Wang, 1992; 
Weissenbach et al., 1992; Gyapay et al.. 1994). Clones recovered 

10 for each of the initial marker loci were arranged into an ordered 
series of partially overlapping clones ("contig") using four 
independent methods. First, sequences representing the ends of 
the YAC insert were isolated by inverse PCR (Riley et al . , 1990), 
and hybridized to Southern blot panels containing restriction 

15 digests of DMA from all of the YAC clones recovered for all of 
the initial loci in order to identify other YAC clones bearing 
overlapping sequences. Second, inter-Alu PCR was performed on 
each YAC, and the resultant band patterns were compared across 
the pool of recovered YAC clones in order to identify other 

20 clones bearing overlapping sequences (Bellamne-Chartelot et al., 
1992; Chumakov et al., 1992). Third, to improve the specificity 
of the Alu-PCR fingerprinting, the YAC DNA was restricted with 
Haelll or Rsal, the restriction products were amplified with both 
Alu and LIH consensus primers, and the products were resolved by 

25 polyacrylamide gel electrophoresis. Finally, as additional STSs 
were generated during the search for transcribed sequences, these 
STSs were also used to identify overlaps. The resultant contig 
was complete except for a single discontinuity between YAC932C7 
bearing D14S53 and YAC746B4 containing D14S61. The physical map 

30 order of the STSs within the contig was largely in accordance 
with the genetic linkage map for this region (NIH/CEPH 
Collaborative Mapping Group, 1992; Wang and Weber, 1992; 
Weissenbach et al., 1992; Gyapay et al., 1994). However, as with 
the genetic maps, it was not possible to resolve unambiguously 

35 the relative order of the loci within the D14S43/D14S71 cluster 
and the D14S7&/D14S273 cluster. PACl clones suggested that 
D14S277 is telomeric to D14S26B, whereas genetic maps have 
suggested the reverse order. Furthermore, a few STS probes 
failed to det ct hybridization patterns in at least one YAC clone 

40 which, on the basis of the most parsimonious consensus physical 
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map suid from the genetic map, would have been predicted to 
contain that STS. For instance, the D14S268 (AFM265) and RSCAT7 
STSs are absent from YAC788H12. Because these results were 
reproducible, and occurred with several different STS markers, 
5 these results most likely reflect the presence of small 
interstitial deletions within one of the YAC clones, 

B?^^inPitig Si Cumulative two-P oint lod scores for chromoflome 

14a24.3 markers. 

Genotypes at each polymorphic microsatellite marker locus 
10 were determined by PGR from lOOng of genomic DNA of all available 
affected and iinaf fected pedigree members as previously described 
(St. George-Hyslop et al., 1992) using primer sequences specific 
for each microsatellite locus (Weissenbach et al., 1992; Gyapay 
et al., 1994). The normal population frequency of each allele 
15 was determined using spouses and other neurologically normal 
subjects from the same ethnic groups, but did not differ 
significantly from those established for mixed Caucasian 
populations (Weissenbach et al., 1992; Gyapay et al., 1994). The 
maximum likelihood calculations assumed an age of onset 
20 correction, marker allele frequencies derived from published 
series of mixed Caucasian subjects, and an estimated allele 
frequency for .the AD3 mutation of 1:1000 as previously described 
(St. George-Hyslop et al . , 1992). The analyses were repeated 
using ec[ual marker allele frequencies, and using phenotype 
25 information only from affected pedigree members as previously 

described to ensure that inaccuracies in the estimated parameters 
used in the maximum likelihood calculations did not misdirect the 
analyses (St. George-Hyslop et al. , 1992) These supplemental 
analyses did not significantly alter either the evidence 
30 supporting linkage, or the discovery of recombination events. 
ExanvplQ 3. — HaPlotvpes between flanking markers segregate with 
AD3 in FAD. 

Extended haplotypes between the centromeric and telomeric 
flcuiking markers on the parental copy of chromosome 14 

35 segregating with AD3 in fourteen early onset FAD pedigrees 

(pedigrees NIH2, KGHl, Torl.l, FAD4, FADl, MEXl, and FAD2) show 
pedigree specific lod scores > +3.00 with at least one marker 
between D145258 and D14S53 . Identical partial haplotypes are 
observed in two regions of the disease bearing chromosome 

40 segregating in several pedigrees of similar ethnic origin. In 
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region A, shared alleles are seen at D14S268 ("B": allele size = 
126 bp, allele frequency in normal Caucasians » 0.04; "C**: size » 
124 bp, frequency = 0.38); D14S277 ("B": size = 156 bp, frecjuency 
- 0.19; "C": size ^ 154 bp, frequency « 0.33); and RSCAT6 ("D": 
5 size » lllbp, frequency 0.25; "E": size » 109bp, frequency - 
0.20; "F": size = 107 bp, frequency = 0.47). in region B, 
alleles of identical size are observed at D14S43 ("A**: size » 
193bp, frecjuency = 0.01; "D" : size = 187 bp, frequency = 0.12; 
size a 185 bp, frequency « 0.26; "I": size » 160 bp, 

10 frequency « 0.38); D14S273 ("3": size « 193 bp, frequency « 0.38; 
"4*' size B 191 bp, frequency » 0.16; *'5»: size « 189 bp, 
frequency » 0.34; "6": size = 187 bp,, frequency s 0.02) and 
D14S76 ("1": size « bp, frequency = 0.01; "5": size « bp, 
frequency « 0.38; "6": size = bp, frequency = 0.07; "9": size » 

15 bp, frequency » 0.38). See Sherrington et al. (1995) for 
details. 

Py^TO^e 4. Recovery of transcribed seouenceg from the AD3 

^.^peirva; , 

Putative trauascribed sequences encoded in the AD3 interval 

20 were recovered using a direct hybridization method in which short 
cDNA fragments generated from human brain mRNA were hybridized to 
immobilized cloned genomic DNA fragments (Rommens et al., 1993) . 
The resultant short putatively trauiscribed sequences were used as 
probes to recover longer transcripts from human brain cDNA 

25 libraries (Stratagene, La Jolla) . The physical locations of the 
original short clone and of the subsequently acc[uired longer cDNA 
clones were established by analysis of the hybridization pattern 
generated by hybridizing the probe to Southern blots containing a 
pemel of EcoRI digested total DNA samples isolated from 

30 individual YAC clones within the contig. The nucleotide sequence 
of each of the longer cDNA clones was determined by automated 
cycle sequencing (Applied Biosystems Inc., CA) , and compared to 
other sequences in nucleotide and protein databases using the 
blast algorithm (Altschul et al., 1990). Accession numbers for 

35 the transcribed sec[uences are: L40391, L40392, L40393, L40394, 
L4039S, L40396, L40397, L40398, L40399, L40400, L40401, M0402, 
and L40403. 

Example 5. Locating mutations in the PSl aene using restriction 
enzymes . 

40 The presence of the A246E mutation, which creates a Ddel 
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restriction site, was assayed in genomic DNA by PCR using an end 
IsQseled primer corresponding essentially to bp 907-925 o£ SEQ ID 
NO: 1 and an iinlabelled primer corresponding to the complement of 
bp 1010-990 of SEQ ID NO: 1, to amplify an 84bp genomic exon 
5 fragment using lOOng of genomic DNA template, 2mM MgCl,, lo pMoles 
of each primer, 0.5U Tag polymerase, 250 vH dNTPs for 30 cycles 
of 95 «C X 20 seconds, 60^0 X 20 seconds « 12^C X 5 seconds. The 
products were incubated with an excess of Ddel for 2 hours 
according to the manufacturer's protocol, and the resulting 

10 restriction fragments were resolved on a 6% nondenaturing 
polyacrylamide gel and visualized by autoradiography. The 
presence of the mutation was inferred from the cleavage of the 
84bp fragment to due to the presence of a Ddel restriction site. 
All affected members of the FADl pedigree and several at-risk 

15 members carried the Ddel site. None of the obligate escapees 
(those individuals who do not get the disease, age > 70 years), 
and none of the normal controls carried the Ddel .mutation. 

Example 6. Locating mutations in the PSl aene using allele 

specific oligonucleotides. 

20 The presence of the C410Y mutation was assayed using allele 

specific oligonucleotides. lOOng of genomic DNA was amplified 
with an exonic sequence primer corresponding to bp 1451-1468 of 
SEQ ID NO: 1 and an opposing intronic sequence primer 
complementary to bp 719-699 of SEQ ID NO: 14 using the above 

25 reaction conditions except 2.5 mN MgClj, and cycle conditions of 
94*C X 20 seconds, 58*C X 20 seconds, and 72'C for 10 seconds). 
The resulteuit 2l6bp genomic fragment was denatured by 10-fold 
dilution in 0.4M NaOH, 25 mM EDTA,-and was vacuum slot -blotted to 
duplicate nylon membranes. An end-labeled "wild type" primer 

30 (corresponding to bp 1468-1486 of SEQ ID NO: i) and an end- 
labeled "mutant" primer (corresponding to the same sequence but 
with a G-»A substitution at position 1477) were hybridized to 
separate copies of the slot-blot filters in 5 X SSC, 5 X 
Denhardt's, 0.5% SDS for 1 hour at 48'C, and then washed 

35 successively in 2 X SSC at 23^C and 2 X SSC, 0.1% SDS at 50*C and 
then exposed to X-ray film. All testable affected members as 
well as some at-risk members of the AD3 and NIH2 pedigrees 
possessed the C410Y mutation. Attempts to detect the C410Y 
mutation by SSCP revealed that a common intronic sequence 

40 polymorphism migrated with the same SSCP pattern. 
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Example 7. Northern hybridiz ation demonstrating the eicpression 

of PSl protein mRNA in a variety of tissues. 

Total cytoplasmic RNA was isolated from various tissue 
samples (including heart, brain and different regions of 
5 placenta, lung, liver, skeletal muscle, kidney and pancreas) 
obtained from surgical pathology using standard procedures such 
as CsCl purification. The RNA was then electrophoresed on a 
formaldehyde gel to permit size fractionation. The 
nitrocellulose membrauie was prepared and the RNA was then 

10 transferred onto the membrane. ^^P- labeled cDNA probes were 
prepared and added to the membrane in order for hybridization 
between the probe the RNA to ocpur. After washing, the membrane 
was wrapped in plastic film and placed into imaging cassettes 
containing X-ray film. The autoradiographs were then allowed to 

15 develop for one to several days. Sizing was established by 
comparison to standard RNA markers. Analysis of the 
autoradiographs revealed a prominent band at 3.0 kb in size (see 
Figure 2 of Sherrington et al., 1995). These northern blots 
demonstrated that the PSl gene is expressed in all of the tissues 

20 examined. 

Exaitipjl,^ 9- Eukarvotic and prokarvotic expression vector systems. 

Constructs suitcUt>le for use in eiikaryotic and prokaryotic 
expression systems have been generated using three different 
classes of PSl nucleotide cDNA sequence inserts. In the first 

25 class, termed full-length constructs, the entire PSl cDNA 

sequence is inserted into the expression plasmid in the correct 
orientation, and includes both the natxiral 5' UTR and 3' UTR 
sequences as well as the. entire open reading -frame. The open 
reading frames bear a nucleotide sequence cassette which allows 

30 either the wild type open reading frame to be included in the 
expression system or alternatively, single or a combination of 
double mutations can be inserted into the open reading frame. 
This was accomplished by removing a restriction fragment from the 
wild type open reading frame using the enzymes Narl and Pf Iml and 

35 replacing it with a similar fragment generated by reverse 

transcriptase PGR and bearing the nucleotide sequence encoding 
either the M146L mutation or the H163R mutation. A second 
restriction fragment was removed from the wild type normal 
nucleotide sequence for the open reading frame by cleavage with 

40 the enzymes Pflml and Ncol and replaced with a restriction 



SUBSTmjTE SHEET (RULE 26) 



wo 96/34099 



PCT/CA96/00263 



- 106 - 

fragment bearing the nucleotide sequence encoding the A246E 
mutation, the A260V mutation, the A285V mutation, the L286V 
mutation, the L392V mutation or the C410Y mutation. A third 
varicuat, bearing a combination of either the M146L or H163R 
5 mutation in tauidem with one of the remaining mutations, was made 
by linking a Narl-Pflml fragment bearing one of the former 
mutations and a Pflml-Ncol fragment bearing one of the latter 
mutations . 

The second class of cDNA inserts, termed truncated 

10 constructs, was constructed by removing the 5' UTR and part of 
the 3' UTR sequences from full length wild type or mutant cDNA 
sequences. The 5' UTR sequence was replaced with a synthetic 
oligonucleotide containing a Kpnl restriction site (GGTAC/C) and 
a small sequence (GCCACC) to create a Kozak initiation site 

15 around the ATG at the beginning of the PSl ORF (bp 249-267 of SEQ 
ID NO: 1). The 3' UTR was replaced with an oligonucleotide 
corresponding to the complement of bp 2568-2586 of SEQ ID NO: 1 
with an artificial EcoRI site at the 5' end. Mutant variants of 
this construct were then made by inserting the rautsoit secjuences 

20 described above at the Narl-Pflml and Pslml-Ncol sites as 
described above. 

The third class of constructs included sequences derived 
from clone cc44 in which an alternative splice of Exon 4 results 
in the elimination of four residues in the N- terminus (SEQ ID NO: 

25 3) . 

For eukaryotic expression, these various cDNA constructs 
bearing wild type and mutant sequences, as described above, were 
cloned into the expression vector pZeoSV in which the SV60 
promoter cassette had been removed by restriction digestion cuid 

30 replaced with the CMV promoter element of pcDNA3 (Invitrogen) . 
For prokaryotic expression, constructs have been made using the 
glutathione S- transferase (GST) fusion vector pGEX-kg. The 
inserts which have been attached to the GST fusion nucleotide 
sequence are the same nucleotide sequences described above 

35 bearing either the normal open reading frame nucleotide seG[uence, 
or bearing a combination of single and double mutations as 
described eibove. These GST fusion constructs allow expression of 
the partial or full-length protein in prokaryotic cell systems as 
mutant or wild type GST fusion proteins, thus allowing 

40 purification of the full-length protein followed by removal of 
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the GST fusion product by thrombin digestion. A further cDNA 
construct was made with the GST fusion vector, to allow the 
production of the amino acid sequence corresponding to the 
hydrophilic acidic loop domain between TM6 and TM7 of the full- 
5 length protein, either as a wild type nucleotide sequence or as a 
mutcuit sequence bearing either the A285V mutation, the L286V 
mutation or the L392V mutation. This was accomplished by 
recovering wild type or mutcUit sequence from appropriate sources 
of RNA using a 5' oligonucleotide primer corresponding to bp 

10 1044-1061 of SEQ ID NO: 1 with a 5' BamHI restriction site 

(G/GATCC) , and a 3' primer corresponding to the complement of bp 
1476-1458 oh SEQ ID NO: l.with a 5' EcoRI restriction site 
(G/AATTC) . This allowed cloning of the appropriate mutant or 
wild type nucleotide sequence corresponding to the hydrophilic 

15 acidic loop domain at the BamHI and the EcoRI sites within the 
pGEX-KG vector. 

gX^mp^e ?. — Locating additional mutations in the P51 gene. 

Mutations in the PSl gene can be assayed by a variety of 
strategies (direct nucleotide sequencing, allele specific oligos, 

20 ligation polymerase chain reaction, SSCP, RFLPs) using RT-PCR 
products representing the mature mRNA/cDNA sequence or genomic 
DNA. For the A260V and the A285V mutations, genomic DNA carrying 
the exon can be amplified using the same PGR primers and methods 
as for the L286V mutation. 

25 PGR products were then denatured and slot blotted to 

duplicate nylon membranes using the slot blot protocol described 
for the G410y mutation. 

The A260V mutation was scored on these blots by using 
hybridization with end-labeled allele-specif ic oligonucleotides 

30 corresponding to the wild type sequence (bp 1017-1036 of SEQ ID 
NO: 1) or the mutant sequence (bp 1017-1036 of SEQ ID NO: 1 with 
C-»T at bp 1027) by hybridization at 48*C followed by a wash at 
52*C in 3X SSC buffer containing 0.1% SDS. The A285V mutation was 
scored on these slot blots as described above but using instead 

35 the allele-specif ic oligonucleotides for the wild type sequence 

(bp 1093-1111 of SEQ ID NO: 1) or the mutant primer (bp 1093-1111 
of SEQ ID NO: 1 with C-»T at bp 1102) at 48*C followed by washing 
at 52*C as above except that the wash solution was 2X SSC. 

The L392V mutation was scored by amplification f the exon 

40 from genomic DNA using primers (5' corresponding to bp 439-456 of 
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SEQ ID NO: 14 and 3' complementary to 719-699 of SEQ ID NO: 14) 
using standard PGR buffer conditions except that the magnesium 
concentration was 2mM and cycle conditions were 94*C X lo seconds, 
56'C X 20 seconds, and 72<'C X 10 seconds. The resulting 200 base 
5 pair genomic fragment was denatured as described for the C410Y 
mutation and slot-blotted in duplicate to nylon membranes. The 
presence or absence of the mutation was then scored by 
differential hybridization to either a wild type end- labeled 
oligonucleotide (bp 1413-1431 of SEQ ID NO: i) or with an end- 

10 labeled mutant primer (bp 1413-1431 of SEQ ID NO: l with C-»G at 
bp 1422) by hybridization at 45'C and then successive washing in 
2X SSC at 23'C and then at 68*C. 
Example 10. An tibody production. 

Peptide antigens corresponding to portions of the PSl 

15 protein were synthesized by solid-phase techniques and purified 
by reverse phase high pressure liquid chromatography. Peptides 
were covalently lin)ced to keyhole limpet hemocyanin (KLH) via 
disulfide linkages that were made possible by the addition of a 
cysteine residue at the peptide C- terminus of the presenilin 

20 fragment. This additional residue does not appear normally in 
the protein sequence and was included only to facilitate linkage 
to the KLH molecule. The specific presenilin sequences to which 
antibodies were raised are as follows: 

Polyclonal antibody # hPSl antigen (SEQ ID NO: 2) 

25 1142 30-44 

519 109-123 

520 304-318 
. 1143 _ 346-360 

These sequences are contained within specific domains of the 
30 PSl protein. For example, residues 30-44 are within the N- 

terminus, residues 109-123 are within the TNl-»2 loop, and 

residues 304-318 and 346-360 are within the large TM6-»7 loop. 

Each of these domains is exposed to the aqueous media and may be 

involved in binding to other proteins critical for the 
35 development of the disease phenotype. The choice of peptides was 

based on analysis of the protein sequence using the IBI Pustell 

cuitigenicity prediction algorithm. 

A total of three New Zealand white rabbits were immunized 

with peptide-KLH complexes for each peptide antigen in 
40 combination with Freund's adjuvant and were sxibsequently given 



SUBSTITUTE SHEET (RULE 26) 



wo 96/34099 



PCT/CA96/00263 



- 109 - 

booster injections at seven day intervals. Antisera were 
collected for each peptide and pooled and IgG precipitated with 
ammonium sulfate. Antibodies were then affinity purified with 
Sulfo-link agarose (Pierce) coupled with the appropriate peptide, 
5 This final purification is required to remove non-specific 

interactions of other antibodies present in either the pre- or 
post -immune serum. 

The specificity of each antibody was confirmed by three 
tests. First, each detected single predominant bands of the 

10 approximate size predicted for presenilin-1 on Western blots of 
brain homogenate. Second, each cross-reacted with recombinant 
fusion proteins bearing the appropriate sequence. Third each 
could be specifically blocked by pre -absorption with recombinant 
FSl or the immunizing peptide. 

15 In addition, two dififerent PSl peptide glutathione S- 

transferase (GST) fusion proteins have been used to generate PSl 
antibodies. The first fusion protein included amino acids 1-81 
(N terminus) of PSl fused to GST. The second fusion protein 
included amino acids 266-410 (the TM6-»7 loop domain) of PSl fused 

20 to GST. Constructs encoding these fusion proteins were generated 
by inserting the appropriate nucleotide sequences into pGEX-2T 
expression plasmid (Amrad) . The resulting constructs included 
sequences encoding GST and a site for thrombin sensitive cleavage 
between GST and the PSl peptide. The expression constructs were 

25 trauisfected into DH5a E.coli and expression of the fusion 

proteins was induced using IPTG. The bacterial pellets were 
lysed and the soluble GST- fusion proteins were purified by single 
step affinity chromatography on glutathione sepharose beads 
(Boehringer-Mannheim, Montreal) . The GST-fusion proteins were 

30 used to immxinize mice to generate monoclonal antibodies using 
standard procedures. Clones obtained from these mice were 
screened with purified presenilin fragments. 

In addition, the GST- fusion proteins were cleaved with 
thrombin to release PSl peptide. The released peptides were 

35 purified by size exclusion HPLC and used to immunize rabbits for 
the generation of polyclonal antisera. 

By similar methods, GST fusion proteins were made using 
constsnicts including nucleotide sequences for amino acids 1 to 87 
(N terminus) or 272 to 390 (TM6-»TM7 loop) of presenilin-2 and 

40 employed to generate monoclonal antibodies to that protein. The 
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PS2-GST fusion proteins were also cleaved with thrombin and the 
released, purified peptides used to immunize rabbits to prepare 
polyclonal antisera. 

Exi^mple Identification of mutations in PS2 gen^. 

5 RT-PCR products corresponding to the PS2 ORF were generated 

from RNA of lymphoblasts or frozen post-raortera brain tissue using 
a first oligonucleotide primer pair with the 5' primer 
corresponding to bp 478-496 of SEQ ID NO: 18, and the 3' primer 
complementary to bp 1366-1348 of SEQ ID NO: 18, for a 888 bp 

10 product, and a second primer pair with the 5' primer 

corresponding to bp 1083-1102 of SEQ ID NO: 18, and the 3' primer 
complementary to bp 1909-1892 of SEQ ID NO: 18, for a 826 bp 
product. PGR was performed using 250 mMol dNTPs, 2.5 mM MgCl2, 
10 pMol oligonucleotides in 10 ml cycled for 40 cycles of 94 ©C X 

15 20 seconds, 58»C X 20 seconds, 72 •C X 45 seconds. The PGR 
products were sequenced by automated cycle sequencing (ABI, 
Poster City, CA) and the fluorescent chroraatograms were scanned 
for heterozygous nucleotide substitutions by direct inspection 
and by the Factura (ver 1.2.0) and Sequence Navigator (ver 

20 l.O.lblS) software packages (data not shown). 

Detection of the N141I mutation: The A->T substitution at 
nucleotide 787 creates a Bell restriction site. The exon bearing 
this mutation was amplified from 100 ng of genomic DNA using 
lOpMol each of oligonucleotides corresponding to bp 733-751 of 

25 SEQ ID NO: 18 (end-labeled) and the complement of bp 846-829 of 
SEQ ID NO: 18 (imlabelled) , and PGR reaction conditions similar 
to those described below for the M239V mutation. 2ml of the PGR 
product was restricted with Bell (NEBL, Beverly, MA) in 10 ml 
reaction volume according to the meuiuf acturers' protocol, and the 

30 products were resolved by non- denaturing polyacrylamide gel 

electrophoresis. In subjects with wild type sequences, the 114 
bp PGR product is cleaved into 68 bp and 46 bp fragments. Mutant 
sequences cause the product to be cleaved into 53 bp, 46 bp and 
15 bp. 

35 Detection of the M239V mutation: The A-»G substitution at 

nucleotide 1080 deletes a Nlalll restriction site, allowing the 
presence of the M239V mutation to be detected by amplification 
from 100 ng of genomic DNA using lOpMol each of oligonucleotides 
corresponding to bp 1009-1026 of SEQ ID NO: 18 and the complement 

40 of bp 1118-1101 of SEQ ID NO: 18. PGR conditions were: 0.5 U 
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Taq polymerase, 250 mM dNTPS, ImCi or"P-dCTP, 1.5 mM MgClj, 10 ml 
volume; 30 cycles of 94*C X 30 seconds, 58*C X 20 seconds, 72"C X 
20 seconds, to generate a 110 bp product. 2 ml of the PGR 
reaction were diluted to 10 ml and restricted with 3 U of Nlalll 
5 (NEBL, Beverly, MA) for 3 hours. The restriction products were 
resolved by non- denaturing polyacrylamide gel electrophoresis and 
visualized by autoradiography. Normal subjects show cleavage 
products of 55, 35, 15 and 6 bp, whereas the mutant sequence 
gives fragments of 55, 50 and 6 bp. 

10 Detection of the I420T mutation: Similarly to the 

procedures above, the 142 OT mutation may be screened for by PGR 
amplification of genomic DNA using primers corresponding to bp 
1576-1593 of SEQ ID NO: 18 and the complement of bp 1721-1701 of 
SEQ ID NO: 18 to generate a 146 base pair product. This product 

15 may then be probed with allele specific oligonucleotides for the 
wild-type (e.g., bp 1616-1632 of SEQ ID NO: IB) and mutant (e.g., 
bp 1616-1632 of SEQ ID NO: 18 with a T-^C substitution at bp 1624) 
sequences . 

Example 12. Transgenic mice. 

20 A series of wild type and mutant PSl and PS2 genes were 

constructed for use in the preparation of transgenic mice. 
Mutant versions of PSl and PS2 were generated by site-directed 
mutagenesis of the cloned cDNAs cc33 (PSl) and cc32 (PS2) using 
standard techniques. 

25 cDNAs cc33 and cc32 and their mutant versions were used to 

prepare two classes of mutant and wild type PSl and PS2 cDNAs, as 
described in Example 8. The first class, referred to as "full- 
length" cDNAs, were prepared by removing approximately 200 bp of 
the 3' untranslated region immediately before the poly A site by 

30 digestion with EcoRI (PSl) or PvuII (PS2) . The second class, 

referred to as "truncated" cDNAs, were prepared by replacing the 
5' untranslated region with a ribosome binding site (Kozak 
consensus sequence) placed immediately 5' of the ATG start codon. 
Various full length and truncated wild type and mutant PSl 

35 and PS2 cDNAs, prepared as described above, were introduced into 
one or more of the following vectors and the resulting constructs 
were used as a soiirce of gene for the production of transgenic 
mice. 

The cos.TET expression vector : This vector was derived from 
40 a cosmid clone containing the Syrieui hamster PrP gene. It has 
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been described in detail by Scott et al. (1992) and Hsiao et al. 
(1995) . PSl and PS2 cDNAs (full length or truncated) were 
inserted into this vector at its Sail site. The final constructs 
contain 20 kb of 5' sequence flanking the inserted cDNA. This 5' 
5 flanking sequence includes the PrP gene promoter, 50 bp of a PrP 
gene 5' untranslated region exon, a splice donor site, a 1 kb 
intron, and a splice acceptor site located immediately adjacent 
to the Sail site into which the PSl or PS2 cDNA was inserted. 
The 3' sequence flanking the inserted cDNA includes an 
10 approximately 8 kb segment of PrP 3' untranslated region 

including a polyadenylat ion signal. Digestion of this construct 
with NotI (PSl) or Fsel (PS2) released a fragment containing a 
mutant or wild type PS gene under the control of the PrP 
promoter. The released fragment was gel purified and injected 
IS into the pronuclei of fertilized mouse eggs using the method of 
Hsiao et al. (1995) . 

P3.^tgXg<:-dgyivfid growth factor receptor fl-aiihtir^^^ 
<?pngtr^?^S: PS CDNAs were also introduced between the Sail (full 
length PSl cDNAs) or Hindlll (truncated PSl cDNAs, full length 
20 PS2 cDNAs, and truncated PS2 cDNAs) at the 3' end of the human 
platelet derived growth factor receptor /5-subunit promoter and 
the EcoRI site at the 5' end of the SV40 poly A sequence and the 
entire cassette was cloned into the pZeoSV vector (Invitrogen, 
San Diego, CA.). Fragments released by Scal/BamHI digestion were 
25 gel purified and injected into the pronuclei of fertilized mouse 
eggs using the method of Hsiao et al. (1995) . 

Human g-actin constructs ■ PSl and PS2 cDNAs were inserted 
into the Sail site of pBAcGH. The construct produced by this 
insertion includes 3.4 kb of the human p actin 5' flanking 
30 sequence (the human actin promoter, a spliced 78 bp hiiman 

actin 5' \intranslated exon and intron) and the PSl or PS2 insert 
followed by 2.2 kb of human growth hormone genomic sequence 
containing several introns and exons as well as a polyadenylation 
signal. Sfil was used to release a PS-containing fragment which 
35 was gel purified and injected into the pronuclei of fertilized 
mouse eggs using the method of Hsiao et al. (1995). 

PhosDhoQlvcerate kinase cQnstmefcfl. psi and PS2 cDNAs were 
introduced into the p)cJ90 vector. The cDNAs were inserted 
between the Kpnl site downstream of the human phosphoglycerate 
40 kinase promoter and the Xbal site upstream of the 3' untranslated 
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region of the human phosphoglycerate kinase gene. PvuII/Hindlll 
{PSl cDNAs) or PvuII (PS2 cDNAs) digestion was used to release a 
PS-containing fragment which was then gel purified and injected 
into the pronuclei of fertilized mouse eggs as described above. 
5 B?i;^mpA^ — Expression of recombinant PSl and PS2 in eukarvof 

Recombinant PSl and PS2 have been expressed in a variety of 
cell types (e.g. PC12, neuroblastoma, Chinese hamster ovary, and 
human embryonic kidney 293 cells) using the pcDNA3 vector 

10 (Invitrogen, San Diego, CA.). The PSl and PS2 cONAs inserted 
into this vector were the same full length and truncated cDNAs 
described in Example 8 . 

These cDNAs were inserted between the CMV promoter and the 
bovine growth hormone polyadenylation site of pcDNA3 . The 

15 transgenes were expressed at high levels. 

In addition, PSl and PS2 have been expressed in COS cells 
using the pCMX vector. To facilitate tagging and tracing of the 
intracellular localization of the presenilin proteins, 
oligonucleotides encoding a sequence of 11 amino acids derived 

20 from the human c*myc antigen (see, e.g., Evan et al., 1985) and 
recognized by the monoclonal anti-myc antibody MYC 1-9E10.2 
(Product CRL 1729, ATCC, Rockville, Md.) were ligated in-frame 
either immediately in front of or immediately behind the open 
reading frame of PSl and PS2 cDNAs. Untagged pCMX constructs 

25 were also prepared. The c-myc-tagged constructs were also 
introduced into pcDNA3 for transfection into CHO cells. 

Transient and stable transfection of these constructs has 
. been achieved using Lipof ectamine .(Gibco/BRL)_. according to the 
manufacturer's protocols. Cultures were assayed for transient 

30 expression after 48 hours. Stably transfected lines were 
selected using 0.5 mg/ml Geneticin (Gibco/BRL) . 

Expression of transfected PS proteins was assayed by Western 
blot using the ant i -presenilin antibodies 1142, 519 and 520 
described above. Briefly, cultured transfected cells were 

35 soliibilized (2% SDS, 5 mM EDTA, 1 mg/ml leupeptin and aprotinin) , 
and the protein concentration was determined by Lowry. Proteins 
were separated on SDS-PAGE gradient gels (4-20% Novex) and 
transferred to PVDF (10 mM CAPS) for 2 hr at a constauit voltage 
(50V) . Non-specific binding was blocked with skim milk (5%) for 

40 1 hr. The proteins were then probed with the two rabbit 
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polyclonal antibodies (-Img/ml in TBS, pH 7.4) for 12 hrs, 
Presenilin cross -reactive species were identified using 
biotinylated goat-cuiti rabbit secondary antibody which was 
visualized using horseradish peroxidase -conjugated strepavadin 
5 tertiary, 4-chloro-napthol, and hydrogen peroxide. The c-rayc- 
tagged presenilin peptides were assayed by Western blotting using 
both the ant i -presenilin antibodies described above (to detect 
the presenilin peptide antigen) , and culture supernatant from the 
hybridoma MYC 1-9E10.2 diluted 1:10 for Western blots and 1:3 for 

10 iimnunocytochemistry (to detect the myc-epitope) . A major band of 
imniunoreactivity of 50-60 kDa was identified by each of the 
various presenilin antibodies, and by the myc-epitope antibodies 
(for cell lines transfected with myc- containing plasraids) . Minor 
bands at -10-19 )cDa and at -70)cDa were detected by some 

15 presenilin antibodies. 

For immunocytochemistry, transfected cells were fixed with 
4% formaldehyde in Tris buffered saline (TBS) , washed extensively 
with TBS plus 0.1% Triton and non-specific binding blocked with 
3% BSA. Fixed cells were probed with the presenilin antibodies 

20 (e.g., antibodies 520 and 1142, above; typically 5-10 mg/ml) , 
washed and visualized with FITC- or rhodamine- conjugated goat- 
anti rabbit secondary antibody. For c-myc-tagged presenilin 
constructs, the hybridoma MYC 1-9E10.2 supernatant diluted 1:3 
was used with anti-mouse secondary antibody. Slides were moxin ted 

25 in 90% glycerol with 0.1% phenylenediamine (ICN) to preserve 

fluorescence. Anti-BIP (or anti-calnexin) (StressGen, Victoria, 
B.C.) and wheat germ agglutinin (BY Labs, San Mateo, CA) were 
used as markers of endoplasmic reticulum and -Golgi respectively. 
Double-immuno- labeling was also performed with anti-actin (Sigma, 

30 St. Louis, Mo.), anti-amyloid precursor protein (22C11, 

Boehringer Mannheim) and anti-neurof ilament (NF-M specific, 
Sigma) in neuronal line NSC34. These immunofluorescence studies 
demonstrated that the treuisfection product is widely distributed 
within the cell, with a particularly intense perinuclear 

35 localization suggestive of the endoplasmic reticulum and the 
Golgi apparatus, which is similar to that observed in 
wtransf acted cells but is more intense, sometimes spilling over 
into the nuclear membrane. Co-immunolocalization of the c-myc 
and PS epitopes was observed in CHO and COS cells transiently 

40 transfected with the myc-tagged presenilin constructs. 
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Robust expression of the transfected presenilin gene in the 
transfected cells was thus proven by itnmunocytochemistry, 
Northern blot, Western blots (using antibodies to presenilins as 
above, and using the monoclonal antibody MYC 1-9E10.2 to the myc 
5 tag in constructs with 3' or 5' c-itiyc tags). 

Py^tnp?.e 3,^T Isolation of pre senilin binding proteins bv affinity 

fffryonic^tpqygtphY. 

To identify the proteins which may be involved in the 
biochemical fiinction of the presenilins, PSl-binding proteins 
10 were isolated using affinity chromatography. A GST-fusion 

protein containing the PSl TM6-*7 loop, prepared as described in 
Example B, was used to probe human brain extracts, prepared by 
homogenizing brain tissue by Polytron in physiological salt 
solution. Non-specific binding was eliminated by pre-clearing 
15 the brain homogenates of endogenous GST-binding components by 
incxibation with glutathione-Sepharose beads. These GST- free 
homogenates were then incxibated with the GST- PS fusion proteins 
to produce, the desired complexes with functional binding 
proteins. These complexes were then recovered using the affinity 
20 glutathione-Sepharose beads. After extensive washing with 

phosphate buffered saline, the isolated collection of proteins 
was separated by SDS-polyacrylamide gel electrophoresis (SDS- 
PAGE; Tris-tricine gradient gel 4-20%) . Two major bands were 
observed at -14 and 20 kD in addition to several weaker bands 
25 ranging from 50 to 60 kD. 

Pharmacologic modification of interaction between these 
proteins and the TM6-»7 loop may be employed in the treatment of 
. Alzheimer's Disease In addition, these_proteins which are 
likely to act within the presenilin biochemical pathway may be 
30 novel sites of mutations that cause Alzheimer's Disease. 

gj^amp?,^ J|?T Isolation of pres enilin binding proteins by two- 

hvbrid veast system. 

To identify proteins interacting with the presenilin 
proteins, a yeast expression plasmid vector (pAS2-l, Clontech) 
35 was generated by ligating an in- frame partial cDNA sequence 

encoding either residues 266-409 of the PSl protein or residues 
272-390 of the PS2 protein into the EcoRI and BamHI sites of the 
vector. The resultant fusion protein contains the GAL4 DNA 
binding domain coupled in- frame either to the TM6->7 loop of the 
40 PSl protein or to the TM6-»7 loop of the PS2 protein. These 
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expression plasmids were co- transformed, along with purified 
plasmid DNA from the human brain cDNAipACT library, into yeast 
using the protocols of the Clontech Matchmaker yeast- two-hybrid 
kit (Clontech) . Yeast clones bearing humsui brain cDNAs which 
5 interact with the TM6-»7 loop domain were selected by HIS 
resistwce and activation. The clones were further 

selected by cyclohexamide sensitivity and the inserts of the 
human brain cDNAs were isolated by PGR and sequenced. Of 6 
million initial trsuisformants , 200 positive clones were obtained 

10 after HIS selection, and 42 after Pgal-^ color selection, carried 
out in accordance with the manufacturer's protocol for selection 
of positive colonies. Of these 42 clones there were several (5- 
8) independent clones representing the same genes. This 
indicates that these interactions are biologically real and 

15 reproducible. 

Example 16. Tr^nsq^ni,? ffj,^qfflg- 

Transgenic C. eleoans were obtained by microinjection of 
oocytes. The vectors pPD49.3 hsp 16-41 and pPD49.78 hsp 16-2 
were chosen for this purpose. Using the first of these vectors, 

20 transgenic C. eleoans were produced in which a normal hPSl gene 
or a mutant (L392V) was introduced. Transformed animals were 
detected by assaying expression of human cDNA on northern blots 
or western blots using human cDNA probe cc32 and antibodies 519, 
520 and 1142, described ctbove. Vectors were also prepared and/or 

25 injected bearing a cis double mutant hPSl gene (M146L and L392V) , 
a normal hPS2 gene, and a mutant (N141I) hPS2 gene. 
Example 17, Cloning of a Drosonhila presenilin homolooue. DmPS- 

Redundant oligonucleotides- ^' -ctn.ccn gar_ tgg_.acn gyc tgg 
{SEQ ID NO: 22) and 5' rca ngc (agt)at ngt ngt rtt cca (SEQ ID 

30 NO: 23) were designed from published nucleotide sequence data for 
highly conserved regions of the presenilin/6el-l2 proteins 
ending/begixming with Trp (e.g., at residues Trp247 and Trp404 in 
PSl; Trp253 amd Trp385 in PS2) . These primers were used for RT- 
PCR (50ml volume, 2mM MgCla, 30 cycles of 94*C x 30", 57'C x 20", 

35 72*C X 20") from mRNA from adult and embryonic D. melanoaaster . 
The products were then reamplified using cycle conditions of 94*C 
X 1', S9*C X 0.5' and 72*C x 1' and internal conserved redundant 
primer 5' ttt ttt etc gag acn gen car gar aga aay ga (SEQ ID NO: 
24) and 5' ttt ttt gga tec tar aa(agt) atr aar ten cc (SEQ ID NO: 

40 25) . The -^eoo bp product was cloned into the BamHI and Xhol 
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sites of pBS. These products were sequenced and shown to contain 
em. open reading frame with a putative amino acid sequence highly 
homologous to that of the human presenilins. This fragment was 
then used to screen a conventional D. melanoaaster cDNA/Zap 
5 library {Stratagene, CA) to recover six independent cDNA clones 
of size - 2-2.5 kb (clones pds8, pdslS, pdsl, pds3, pds7 and 
pdsl4) which were sequenced. The longest ORF encodes a 
polypeptide of 541 amino acids with 52% identity to the hiiman 
presenilins. 

10 ^ftTIIP^e 3t8- Assays for long isoforms of the AB peptiides, 

AS peptides were extracted with 99% formic acid for 60 
minutes (20 •C) from frozen cerebral cortex of histopathologically 
confirmed cases of FAD with PSl or fiAPP^^^ mutations; sporadic AD 
with no known family history of the disease; other adult onset 
15 neurodegenerative, disorders (HD « Huntington Disease ; 

ALS s amyotrophic lateral sclerosis); Down's Syndrome (DS) ; and 
control subjects without neurologic symptoms. After 
cent rifugat ion at 200,000 X g for 20 minutes, the supernatant was 
separated from the pellet, diluted, neutralized and examined by 
20 ELI5A. To quantitate different species of AS, four monoclonal 
antibodies were used. Antibody BNT-77 (which detects epitopes 
from the center of Afi) and antibody BAN-50 (which detects 
N- terminal residues) were used first to bind all types of AS 
including heterologous forms with or without N-terminal 
25 truncation (BNT-77) or only without N-terminal truncation 
(BAN-50) . Two additional monoclonal antibodies, which 
specifically detect either short-tailed AS ending at residue 40 
(antibody BA-27)_ or long^ tailed _AS ending, at residues 42/43 
(antibody BC-05) , were then used to distinguish the different 
30 terminal forms of AS. Two site ELISA was carried out as 

described previously (Tamaoka et al., 1994; Suzuki et al., 1994). 
Briefly, 100 fig of standard peptides or the supematants from 
brain tissue were applied onto microplates coated with the BNT-77 
antibody, incubated at 4''C for 24 hours, washed with phosphate- 
35 buffered saline, and then incxibated with HRP- labeled BA-27 and 
BC-05 antibodies at 4*C for 24 hours. HRP activities were 
assayed by color development using the TNB microwell peroxidase 
system as previously described. Cortical AS levels were compared 
between diagnostic groups using paired Student-t tests. Joint 
40 evaluation of all the AS isoform data, using the Student -Newman - 
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Keuls multiple comparison of means test, revealed that AlSl-42 
levels from SAPP,!, and sporadic AD subjects were distinct from 
those for PSl mutation cases, but similar to controls. In 
contrast, three group were distinguishable when levels 
5 were considered: high (PSl and fiAPP^i, AD), medium (sporadic AD) 
and low (control) . 

Specifically, measurement of the concentrations of the 
various isoforms in the cerebral cortex of 14 control 
subjects, including five subjects with other neurodegenerative 
10 diseases with onset in the fourth and fifth decades of life, 

revealed only low concentrations of both short- tailed Ap (A/51-40: 
0.06 ± 0.02 nMol /gram wet tissue ± SEM; A/?x-40: .0.17 t 0,40) and, 
long-tailed A^ (A/Jl-42/43: 0.35 ± 0.17; A5X-42/43: 1.17 ± 
0.80). In contrast, the long-tailed A^ peptides were 
15 significantly elevated in the cerebral cortex of all four 

subjects with PSl mutations (Atfl-42/43: $.54 ± 2.0, p = 0.05; 
A^X'-42/43: 23.91 ± 4.00, p < 0.01). Similar increases in the 
concentration of long-tailed A/3 peptides were detected in the 
cortex of both subjects with ^APP,i, mutations {A/Sl-42/43: 2.03 ± 
20 . 1.04; A/8X-42/43: 25.15 ± 5.74), and subjects with sporadic AD 

{A/51-42/43: 1.21 ± 0.40, p » 0.008; Ai9x-42/43: 14.45 ± 2.81, p 
» 0.001). In subjects with PSl or /SAPP^^, mutations, this 
increase in long-tailed isoforms of A0 was accompanied by a small 
but non- significant increase in short-tailed A/5 isoforms (e.g., 
25 A/5X-40: 3.08 ± 1.31 in PSl mutants; 1.56 ± 0.07 in i5APP,„ 

mutants) . Thus, the ratio of long to short isoforms was also 
significantly increased. However, in the sporadic AD cases, the 
observed increase in long- tailed A/5 was accompanied typically by 
a much larger increase in short-tailed A^ isoforms (A^l-40: 3.92 
30 ± 1.42; A/Jx-40: 16.60 ± 5.88). This increase in short-tailed A/5 
was statistically significant when compared to controls (p < 0.03 
for both APl-40 and A^x-40) , but was of borderline statistical 
significance when compared to the PSl and fiA^Vj^^ cases (p _ 
0.05). Analysis of cortical samples from an adult subject with 
35 Down's syndrome revealed a pattern similar to that observed in 
sporadic AD. 

Although preferred embodiments of the invention have been 
described herein in detail, it will be understood by those 
skilled in the art that variations may be made thereto without 
40 departing from the spirit of the invention or the scope of the 
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appended claims. 

TABLE 1 



ELEMENT 


POSITION 


ELEMENT 




POSITION 


STATl (GAS) 


34-46 
278-286 


611-619 
631-639 


CAT box 






895-900 
975-982 




431-439 


1582-1590 


TATA box 






925-933 




443-451 


1965-1973 








978-988 




495-503 


2125-2133 


TPIID 






578-581 




533-541 










982-985 


STAT3 


36-43 
124-131 


737-744 
811-898 


TRXN (CAP) 
start 






1002-1007 
1038-1043 




429-436 


1063-1070 


GC box 
(SPl) 






1453-1460 




496-503 


1686-1693 








1454-1452 




533-540 


1966-1973 


AP2, AP2-Uke 


numeroua 


occurrences 




537-544 


2104-2111 






throughout sequence 




632-639 


2407-2414 


NFIL6 


€XX 


-620 


1567-1576 


MEDl^MEDl-lUce 


1121-1126 


1235-1240 




890 


-899 


1945-1954 




1126-1131 


1716-1721 




1062- 


1071 
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TABLE 2 





PSl Domain 


ADuroximate Position 




N- terminus 


1-81 




TMl 


82-100 


5 


TMl-»2 


101-132 




TM2 


133-154 




TM2-»3 


155-163 




TM3 


164-183 




TM3-^ 


184-194 


10 


TM4 


195-212 




TM4-*5 


213-220 




TM5 


221-238 




TM5-»6 


239-243 




TM6 


244-262 


15 


TM6-^7 


263-407 




TM7 


408-428 




C- terminus 


429-467 
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TABLE 3 



PS2 Domain 


ADoroximate Position 


N - t e rminus 


1-87 


TMl 


88-106 


TMl-»2 


107-134 


TM2 


135-160 


TM2-+3 


161-169 


TM3 


170-189 


TM3-»4 


190-200 


TM4 


201-218 


TM4-#5 


219-224 


TM5 


225-244 


TM5«*6 


245-249 


TM6 


250-268 


TM6-»7 


269-387 


TM7 


388-409 


C- terminus 


410-448 
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TABLE 4 





Position in 


Nucleotide 


Amino Acid 


Functional 


Age of 




SEQ ID N0:1 


Change 


Change 


Domain 


Onset of 












FAD 


1. 


NA 


NA 


A79? 


N- terminus 


64 


2. 


492 


G^C 


V82L 


TMl 


55 


3. 


NA 


NA 


V96F 


TMl 


NA 


45 


591 


T-*C 


Y115H 


TMl-*2 


37 


5. 


664 


T-»C 


M139T 


TM2 


49 


6. 


NA 


NA 


M139V 


TM2 


40 


7. 


676 


T-»C 


I143T 


TM2 


35 


8. 


684 


A-^ 


M146L 


TM2 


45 


90 


NA 


NA 


M146V 


TM2 


38 


10. 


736 


AhO 


H163R 


TM2-»3 


50 


11. 


NA 


NA 


H163Y 


TM2-»3 


47 


12. 


NA 


NA 


L171P 


TM3 


35 


13- 


NA 


NA 


G209V 


TM4 


NA 




NA 


NA 


I211T 


TM4 


NA 


15. 


939 


G-»A 


A231T 


JMS 


52 


16. 


985 


C-»A 


A246E 


TM6 


55 


17. 


1027 


C-*T 


A260V 


TM6 


40 


18. 


NA 


NA 


C263R 


TM6-»7 


47 


ISO. 


1039 


C-»T 


P264L 


TM6-#7 


45 


20. 


NA 


NA 


P267S 


TM6-*7 


35 


21. 


NA 


NA 


E280A 


TM6-»7 


47 


22. 


NA 


NA 


E280G 


TM6-^7 


42 


23. 


1102 


C-»T 


A28SV 


TM6-»7 


50 


29£. 


1104 


C-»G 


L286V 


TM6-»7 


50 


25. 


NA 


deletion 


A291-319 


TM6-»7 


NA 


26. 


1399 




G384A 


TM6-»7 


35 


27. 


1422 




L392V 


TM6-#7 


25-40 


28. 


1477 


G-»A 


C410Y 


TM7 


48 
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TABLE 5 



1. 
2. 
3. 



Position in 
SEQ ID NO: IB 
787 

1080 

1624 



Nucleotide 
Change 

Ar»6 

T-*C 



Amino Acid 
Change 
N141I 
M239V 
I420T 



Functional 
Domain 

TM2 

TM5 
C- terminus 



Age of 
Onset of FAD 
50-65 
50-70 
45 



28-61 
65-71 
109-112 
120-122 
218-221 
241-243 
267-269 



TABLE 6 

302-310 
311-325 
332-342 
346-359 
372-382 
400-410 



TABLE 7 



25-45 
50-63 
70-75 
114-120 
127-132 
162-167 
221-226 



262-290 
310-314 
321-338 
345-352 
380-390 
430-435 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: KSC RESEARCH AND DEVELOPMENT LIMITED 

PARTNERSHIP 

(B) STREET: S55 University Avenue 

(C) CITY; Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : MSG 1X8 

(G) TELEPHONE: (416) B13-5982 

(H) TELEFAX: (416) 813-5085 

(A) NAME: THE GOVERNING COUNCIL OF THE UNIVERSITY OF 

TORONTO 

(B) STREET: 106 » Simcoe Hall, 27 King's College Circle 
CO CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : M5S lAl 
(6) TELEPHONE: (416) 978-7461 
(H) TELEFAX: (416) 978-1878 

(A) NAME: ST. GEORGE -HYS LOP, Peter K. 

(B) STREET: 210 Richvietr Avenue 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : MSP 3G3 

(A) NAME: FRASER, Paul E. 

(B) STREET: 611 Hindemere Avenue 

(C) CITY: Toronto 

(D) STATE: Ontario 
(£) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : M6S 3L9 

(A) NAME: ROMMENS. Johanna N. 

(B) STREET: 105 McCaul Street 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : MST 2XT 

(ii) TITLE OF INVENTION: GENETIC SEQUENCES AND PROTEINS 

RELATED TO ALZHEIMER'S DISEASE, 
AND USES THEREFOR 

(iii) NUMBER OP SEQUENCES: 25 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sim & Mcfiumey 

(B) STREET: 330 University Avenue, 6th Floor 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP: MSG 1R7 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1«0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/CA96/00263 

(B) FILING DATE: April 29. 1996 

(C) CLASSIFICATION: 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/509,359 

(B) FILING DATE: 31-JUL-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/496,841 

(B) FILING DATE: 28-JUN-199S 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/431,048 

(B) FILING DATE: 28-APR-1995 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: RAE, Patricia A. 

(C) REFERENCE/DOCKET NUMBER: 7425-16 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (416) S9S-11S5 

(B) TELEFAX: (416) 595-1163 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 249. .1649 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1 . , 2675 

(D) OTHER INFORMATION: /note- "hPSl-l" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

TGGOACAGGC AGCTCCGGGG TCCGCGGTTT CACATCGGAA ACAAAACAGC GGCTGGTCTG 60 

GAAG6AACCT GAGCTACGAG CCGCGGCGGC AGCGGGGCGG CGGGGAAGCG TATACCTAAT 120 

CTGGGAGCCT GCAAGTGACA ACAGCCTTTG CGGTCCTTTAG ACAGCTTGGC CTGGAGGAGA 180 

ACACAT6AAA GAAAGAACCTT CAAGAGGCHT TGTTTTCTGT GAAACAGTAT TTCTATACAG 240 

TTGCTCCA ATG ACA GAG TTA CCT GCA CCG TTG TCC TAC TTC CAG AAT GCA 290 
Met Thr Glu Leu Pro Ala Pro Leu Ser Tyr Phe Gin Asn Ala 
1 5 10 

CAG ATG TCT GAG GAC AAC CAC CTG AGC AAT ACT GTA CGT AGC CAG AAT 338 
Gin Met Ser Glu Asp Asn His Leu Ser Asn Thr Val Arg Ser Gin Asn 
15 20 25 30 

GAC AAT AGA GAA CGG CAG GAG CAC AAC GAC AGA C6G AGC CTT GGC CAC 386 
Asp Asn Arg Glu Arg Gin Glu His Asn Asp Arg Arg Ser Leu Gly His 
35 40 45 

CCT GAG CCA TTA TCT AAT GGA CGA CCC CAG GGT AAC TCC CGG CAG GTG 434 
Pro Glu Pro Leu Ser Asn Gly Arg Pro Gin Gly Asn Ser Arg Gin Val 
50 55 60 

GTG GAG CAA GAT GAG GAA GAA GAT GAG GAG CTG ACA TTG AAA TAT GGC 482 
Val Glu Gin Asp Glu Glu Glu Asp Glu Glu Leu Thr Leu Lys Tyr Gly 
65 70 75 

GCC AAG CAT GTG ATC ATG CTC TTT GTC CCT GTG ACT CTC TGC ATG GTG 530 
Ala Lys His Val He Met Leu Phe Val Pro Val Thr Leu Cys Met Val 
60 85 90 
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GTG GTC GTO GCT ACC ATT AAG TCA GTC AGC TTT TAT ACC CGG AAG GAT 578 
Val Val Val Ala Thr lie Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp 
95 100 105 110 

GGG CAG CTA ATC TAT ACC CCA TTC ACA GAA GAT ACC GAG ACT GTG GGC €26 
Gly Gin Leu He Tyr Thr Pro Phe Thr GXu Asp Thr Glu Thr Val Gly 
115 120 125 

CAG AGA GCC CTG CAC TCA ATT CTG AAT GCT GCC ATC ATG ATC ACT GTC. 674 
Gin Arg Ala Leu His Ser He Leu Asn Ala Ala He Met He Ser Val 
130 135 140 

ATT GTT GTC ATG ACT ATC CTC CTG GTG GTT CTG TAT AAA TAC AGG TGC 722 
He Val Val Met Thr He Leu Leu Val Val Leu Tyr Lys Tyr Arg Cys 
145 150 155 

TAT AAG GTC ATC CAT GCC TGG CTT ATT ATA TCA TCT CTA TTG TTG CTG 770 
Tyr Lys Val He His Ala Trp Leu He He Ser Ser Leu Leu Leu Leu 
160 165 170 

TTC TTT TTT TCA TTC ATT TAC TTG GGG GAA GTG TTT AAA ACC TAT AAC 818 
Phe Phe Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn 
175 180 185 190 

GTT GCT GTG GAC TAC ATT ACT GTT GCA CTC CTG ATC TGG AAT TTT GGT 866 
Val Ala Val Asp Tyr He Thr Val Ala Leu Leu He Trp Asn Phe Gly 
195 200 205 

GTO GTO GGA ATG ATT TCC ATT CAC TGG AAA GGT CCA CTT CGA CTC CAG 914 
Val Val Gly Met He Ser He His Trp Lys Gly Pro Leu Arg Leu Gin 
210 215 220 

CAG GCA TAT CTC ATT ATG ATT AGT GCC CTC ATG GCC CTG GTG TTT ATC 962 
Gin Ala Tyr Leu He Met He Ser Ala Leu Met Ala Leu Val Phe He 
225 230 235 

AAG TAC CTC CCT GAA TGG ACT GCG TGG CTC ATC TTG GCT GTG ATT TCA 1010 
Lys Tyr Leu Pro Glu Trp Thr Ala Trp Leu He Leu Ala Val He Ser 
240 245 250 

GTA TAT GAT TTA GTG GCT GTT TTG TGT CCG AAA GGT CCA CTT CGT ATG 1058 
Val Tyr Asp Leu Val Ala Val Leu Cys Pro Lys Gly Pro Leu Arg Met 
255 260 265 270 

CTG GTT GAA ACA GCT CAG GAG AGA AAT GAA ACG CTT TTT CCA GCT CTC 1106 
Leu Val Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu 
275 280 285 

ATT TAC TCC TCA ACA ATG GTG TGG TTG GTG AAT ATG GCA GAA GGA GAC 1154 
He Tyr Ser Ser Thr Met Val Trp Leu Val Asn Met Ala Glu Gly Asp 
290 295 300 

CCG GAA GCT CAA AGG AGA GTA TCC AAA AAT TCC AAG TAT AAT GCA GAA 1202 
Pro Glu Ala Gin Arg Arg Val Ser Lys Asn Ser Lys Tyr Asn Ala Glu 
305 310 315 

AGC ACA GAA AGG GAG TCA CAA GAC ACT GTT GCA GAG AAT GAT GAT GGC 1250 
Ser Thr Glu Arg Glu Ser Gin Asp Thr Val Ala Glu Asn Asp Asp Gly 
320 325 330 

GGG TTC AGT GAG GAA TGG GAA GCC CAG AGG GAC AGT CAT CTA GOG CCT 1298 
Gly Phe Ser Glu Glu Trp Glu Ala Gin Arg Asp Ser His Leu Gly Pro 
335 340 345 350 

CAT CGC TCT ACA CCT GAG TCA CGA GCT GCT GTC CAG GAA CTT TCC AGC 1346 
His Arg Ser Thr Pro Glu Ser Arg Ala Ala Val Gin Glu Leu Ser Ser 
355 360 365 

AGT ATC CTC GCT GGT GAA GAC CCA GAG GAA AGG GGA GTA AAA CTT GGA 1394 
Ser He Leu Ala Gly Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Gly 
370 375 380 
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TTG GGA GAT TTC ATT TTC TAC AGT GTT CTG GTT GGT AAA GCC TCA GCA 1442 
Leu Gly Asp Phe lie Phe Tyr Ser Val Leu Val Gly Lys Ala Scr Ala 
385 390 395 

ACA GCC AGT GGA GAG TGG AAC ACA ACC ATA GCC TGT TTC GTA GCC ATA 1490 
Thr Ala Ser Gly Asp Trp Asn Thr Thr He Ala Cys Phe Val Ala He 
400 405 410 

TTA ATT GGT TTG TGC CTT ACA TTA TTA CTC CTT GCC ATT TTC AAG AAA 1538 
Leu He Gly Leu Cys Leu Thr Leu Leu Leu Leu Ala He Phe Lye Lys 
415 420 425 430 

GCA TTG CCA GCT CTT CCA ATC TCC ATC ACC TTT GGG CTT GTT TTC TAC ISBfi 
Ala Leu Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr 
435 440 445 

TTT GCC ACA GAT TAT CTT GTA CAG CCT TTT ATG GAC CAA TTA GCA TTC 1634 
Phe Ala Thr Asp Tyr Leu Val Gin Pro Phe Mec Asp Gin Leu Ala Phe 
450 455 460 

CAT CAA TTT TAT ATC TAGCATATTT GCGGTTAGAA TCCCATGGAT GTTTCTTCTT 1689 
His Gin Phe Tyr He 
465 

TGACTATAAC CAAATCTGGG GAGGACAAAG GTGATTTTCC TGTGTCCACA TCTAACAAAG 1749 

TCAAGATTCC CGGCTGGACT TTTGCAGCTT CCTTCCAAGT CTTCCTGACC ACCTTGCACT 1809 

ATT6GACTTT GGAAGGAOGT GCCTATAGAA AACGATTTTG AACATACTTC ATCGCAGTGG 1869 

ACTGTGTCCC TCGGTGCAGA AACTACCAGA TTTGAGGGAC GA6GTCAAGG AGATATGATA 1929 

GGCCCGGAAG TTGCTGTGCC CCATCAGCAG CTTGACGCGT GGTCACA6GA CGATTTCACT 1989 

GACACTGCGA ACTCTCAGGA CTACCGGTTA CCAAGAGGTT AGGTGAAGTG GTTTAAACCA 2049 

AACGGAACTC TTCATCTTAA ACTACAOGTT GAAAATCAAC CCAATAATTC TGTATTAACT 2109 

GAATTCTGAA CTTTTCAGGA GGTACTGTGA GGAAGAGCAG GCACCACCAG CAGAATGGGG 2169 

AATGGAGAGG TGGGCAGGGG TTCCAGCTTC CCTTTGATTT TTTGCTGCAG ACTCATCCTT 2229 

TTTAAATGAG ACTTGTTTTC CCCTCTCTTT GAGTCAAGTC AAATATGTAG ATTGCCTTTG 2289 

GCAATTCTTC TTCTCAAGCA CT6ACACTCA TTACCGTCTG TGATTGCCAT TTCTTCCCAA 2349 

GGCCAGTCTG AACCTGAGGT TGCTTTATCC TAAAAGTTTT AACCTCAGGT TCCAAATTCA 2409 

GTAAATTTTG GAAACAGTAC AGCTATTTCT CATCAATTCT CTATCATGTT GAAGTCAAAT 2469 

TTGGATTTtC CACCAAATTC TGAATTTGTA GACATACTTG TACGCTCACT TGCCCCCAGA 2529 

TGCCTCCTCT GTCCTCATTC TTCTCTCCCA CACAAGCAGT CTTTTTCTAC AGCCAGTAAG 2589 

GCAGCTCTGT CRTGGTAGCA GATGGTCCCA TTATTCTAGG GTCTTACTCT TTGTATGATG 2649 

AAAAGAATGT GTTATGAATC GGTCCTGTCA GCCCTGCTGT CAGACCTTCT TCCACAGCAA 2709 

ATGAGATGTA TGCCCAAA6C GGTAGAATTA AAGAAGAGTA AAATGGCTGT TGAAGC 2765 

(2) INFORMATION FOR SEQ ID N0:2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) NOLSCULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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Mec Thr GIu Leu Pro Ala Pro Leu Ser Tyr Phe Gin Asn Ala Gin Met 
15 10 15 

Ser Glu Asp Asn His Leu Ser Aen Thr Val Arg Ser Gin Asn Asp Asn 
20 25 30 

Arg Glu Arg Gin Glu His Asn Asp Arg Arg Ser Leu Gly His Pro Glu 
35 40 45 

Pro Leu Ser Asn Gly Arg Pro Gin Gly Asn Ser Arg Gin Val Val Glu 
50 55 GO 

Gin Asp Glu Glu Glu Asp Glu Glu Leu Thr Leu Lys Tyr Gly Ala Lys 
65 70 75 80 

His Val lie Met Leu Phe Val Pro Val Thr Leu Cys Met Val Val Val 
85 90 95 

Val Ala Thr lie Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp Gly Gin 
100 105 110 

Leu Zle Tyr Thr Pro Phe Thr Glu Asp Thr Glu Thr Val Gly Gin Arg 

115 120 125 

Ala Leu His Ser He Leu Asn Ala Ala He Met He Ser Val He Val 
130 135 140 

Val Met Thr He Leu Leu Val Val Leu Tyr Lys Tyr Arg Cys Tyx Lvs 
145 150 155 ^ ' / 

Val Zle His Ala Trp Leu He He Ser Ser Leu Leu Leu Leu Phe Phe 

165 170 175 

Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn Val Ala 
180 185 190 

Val Asp Tyr He Thr Val Ala Leu Leu He Trp Asn Phe Gly Val Val 
195 200 205 

Gly Met He Ser He His Trp Lys Gly Pro Leu Arg Leu Gin Gin Ala 
210 215 220 

Tyr Leu He Met He Ser Ala Leu Met Ala Leu Val Phe He Lys Tyr 
225 230 235 240 

Leu Pro Glu Trp Thr Ala Trp Lea He Leu Ala Val He Ser Val Tyr 
245 250 255 

Asp Leu Val Ala Val Leu Cys Pro Lys Gly Pro Leu Arg Met Leu Val 
260 265 270 

Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu He Tyr 
275 280 285 

Ser Ser Thr Met Val Trp Leu Val Asn Met Ala Glu Gly Asp Pro Glu 
290 295 300 

Ala Gin Arg Arg Val Ser Lys Asn Ser Lys Tyr Asn Ala Glu. Ser Thr 
305 310 315 320 

Glu Arg Glu Ser Gin Asp Thr Val Ala Glu Asn Asp Asp Gly Gly Phe 

325 330 335 

Ser Glu Glu Trp Glu Ala Gin Arg Asp Ser His Leu Gly Pro His Arg 
340 345 350 

Ser Thr Pro Glu Ser Arg Ala Ala Val Gin Glu Leu Ser Ser Ser He 
355 360 365 

Leu Ala Gly Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Gly Leu Gly 
370 375 380 
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Aap Phe lie Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ala Thr Ala 
385 390 395 400 

Ser Gly Asp Trp Asn Thr Thr lie Ala Cys Phe Val Ala He Leu He 
405 410 415 

Gly Leu Cys Leu Thr Leu Leu Leu Leu Ala He Phe Lya Lys Ala Leu 
420 425 430 

Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr Phe Ala 
435 440 445 

Thr Asp Tyr Leu Val Gin Pro Phe Met Asp Gin Leu Ala Phe His Gin 
450 455 460 

Phe Tyr He 
465 

<2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 557.. 1945 

(ix) FEATURE: 

(A) NAME/KEY: miac^f eature 

(B) LOCATION: I..30B6 

(D) OTHER INFORMATION: /note- **hPSl-2*' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GAATTCGGCA 


CGAGGGAAAT 


GCTGTTTGCT 


OGAAGACGTC 


TCAGGGCGCA 


GGTGCCTTGG 


60 


GCCGGGATTA 


GTAGCCGTCT 


GAACTGGAGT 


GGAGTAGGAG 


AAAGAGGAAG 


CGTCTTGGGC 


120 


TGGGTCTGCT 


TGAGCAACTG 


GTGAAACTCC 


GCGCCTCACG 


CCCCGGGTGT 


GTCCTTGTCC 


180 


AGGGGC6ACG 


AGCATTCTGG 


GCGAAGTCCG 


CACSCCTCTT 


GTTC6AGGCG 


GAAGAC6GG6 


240 


TCTGATSCTT 


TCTCCTTGGT 


CGGGMCTGTC 


TCGAGGCATG 


CATGTCCAGT 


GACTCTTGTG 


100 


TTTGCTGCTG 


CTTCCCTCTC 


AGATTCTTCT 


CACCGTTGTG 


6TCAGCTCTG 


CTTTA66CAT 


360 


ATTAAtCCAt 


AGTGGAGGCT 


GGGATGGGTG 


AGAGAATT6A 


GGTGACTTTT 


CCATAATTCA 


420 


6ACCTAATCT 


6GGAGCCTGC 


AAGTGACAAC 


AGCCTTTGCO 


GTCCTTAGAC 


AGCTTG6CCT 


480 


GGAGGAGAAC 


ACATGAAAGA 


AAGAACCTCA 


AGAG6CTTTG 


TTTTCTGTGA 


AACAGTATTT 


540 


CTATACAGTT 


GCTCCA ATG 
Net 


ACA GAG TTA CCT GCA CCG TTG TCC TAC TTC 
Thr Glu Leu Pro Ala Pro Leu Ser Tyr Phe 


589 



15 10 

CAG AAT GCA CAG ATG TCT GAG GAC AAC CAC CTG AGC AAT ACT AAT GAC 637 
Gin Asn Ala Gin Met Ser Glu Asp Asn His Leu Ser Asn Thr Asn Asp 
15 20 25 

AAT AGA GAA CGG CAG GAG CAC AAC GAC AGA CGG AGC CTT GGC CAC CCT 685 
Asn Arg Glu Arg Gin Glu His Asn Asp Arg Arg Ser Leu Gly His Pro 
30 35 40 

GAG CCA TTA TCT AAT GGA CGA CCC CAG GGT AAC TCC CGG CAG GTG GTG 733 
Glu Pro Leu Ser Asn Gly Arg Pro Gin Gly Asn Ser Arg Gin Val Val 
45 50 55 
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GAG CAA GAT GAG GAA GAA GAT GAG GAG CTG ACA TTG AAA TAT GOC GCC 7B1 
Glu Gin Asp Glu Glu Glu Asp Glu Glu Leu Thr Leu Lys Tyr Gly Ala 
€0 65 70 75 

AAG CAT GTG ATC ATG CTC TTT GTC CCT GTG ACT CTC TGC ATG GTG GTG 829 
Lys His Val He Met Leu Phe Val Pro Val Thr Leu Cys Met Val Val 
80 85 90 

GTC GTG GCT ACC ATT AAG TCA GTC AGC TTT TAT ACC CGG AAG GAT GGG 877 
Val Val Ala Thr He Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp Gly 
95 100 105 

CAG CTA ATC TAT ACC CCA TTC ACA GAA GAT ACC GAG ACT GTG GGC CAG 925 
Gin Leu He Tyr Thr Pro Phe Thr Glu Asp Thr Glu Thr Val Gly Gin 
110 lis 120 

AGA GCC CTG CAC TCA ATT CTG AAT GCT GCC ATC ATG ATC AGT GTC ATT 973 
Arg Ala Leu His Ser He Leu Asn Ala Ala He Met He Ser Val He 
125 130 135 

GTT GTC ATG ACT ATC CTC CTG GTO GTT CTG TAT AAA TAC AGG TGC TAT 1021 
Val Val Met Thr He Leu Leu Val Val Leu Tyr Lys Tyr Arg Cys Tyr 
140 145 150 155 

AAG GTC ATC CAT GCC TGG CTT ATT ATA TCA TCT CTA TTG TTG CTG TTC 1069 
Lys Val He His Ala Trp Leu He He Ser Ser Leu Leu Leu Leu Phe 
160 165 170 

TTT TTT TCA TTC ATT TAC TTG GGG GAA GTG TTT AAA ACC TAT AAC GTT 1117 
Phe Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn Val 
175 180 185 

GCT GTG GAC TAC ATT ACT GTT GCA CTC CTG ATC TGG AAT TTG GGT GTG 1165 
Ala Val Asp Tyr He Thr Val Ala Leu Leu He Trp Asn Leu Gly Val 
190 195 200 

GTG GGA ATG ATT TCC ATT CAC TGG AAA GGT CCA CTT CGA CTC CAG CAG 1213 
Val Gly Met He Ser He His Trp Lys Gly Pro Leu Arg Leu Gin Gin 
205 210 215 

GCA TAT CTC ATT ATG ATT AGT GCC CTC ATG GCC CTG GTO TTT ATC AAG 1261 
Ala Tyr Leu He Met He Ser Ala Leu Met Ala Leu Val Phe He Lys 
220 225 230 235 

TAC CTC CCT GAA TGG ACT GCG TGG CTC ATC TTG GCT GTG ATT TCA GTA 1309 
Tyr Leu Pro Glu Trp Thr Ala Trp Leu He Leu Ala Val He Ser Val 
240 245 250 

TAT GAT TTA GTG GCT GTT TTG TOT CCG AAA GGT CCA CTT CGT ATG CTG 1357 
Tyr Asp Leu Val Ala Val Leu Cys Pro Lys Gly Pro Leu Arg Met Leu 
255 260 265 

GTT GAA ACA GCT CAG GAG AGA AAT GAA AGO CTT TTT CCA GCT CTC ATT 1405 
Val Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu He 
270 275 280 

TAC TCC TCA ACA ATG GTG TGG TTG GTG AAT ATG GCA GAA GGA GAC CCG 1453 
Tyr Ser Ser Thr Met Val Trp Leu Val Asn Met Ala Glu Gly Asp Pro 
285 290 295 

GAA GCT CAA AGG AGA GTA TCC AAA AAT TCC AAG TAT AAT GCA GAA AGC 1501 
Glu Ala Gin Arg Arg Val Ser Lys Asn Ser Lys Tyr Asn Ala Glu Ser 
300 305 310 315 

ACA GAA AGG GAG TCA CAA GAC ACT GTT GCA GAG AAT GAT GAT GGC GGG 1549 
Thr Glu Arg Glu Ser Gin Asp Thr Val Ala Glu Asn Asp Asp Gly Gly 
320 325 330 

TTC AGT GAG GAA TGG GAA GCC CAG AGG GAC AGT CAT CTA GGG CCT CAT 1597 
Phe Ser Glu Glu Trp Glu Ala Gin Arg Asp Ser His Leu Gly Pro His 
335 340 345 
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CCC TCT ACA CCT GAG TCA CGA OCT GCT GTC CAG GAA CTT TCC AGC AGT 1645 
Arg Ser Thr Pro Glu Ser Arg Ala Ala Val Gin Glu Leu Ser Ser Ser 
350 . 355 360 

ATC CTC GCT GGT GAA GAC CCA GAG GAA AGG GGA GTA AAA CTT G6A TTG 1693 
lie Leu Ala Gly Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Gly Leu 
365 370 375 

GGA GAT TTC ATT TTC TAC AGT GTT CTG GTT GGT AAA GCC TCA GCA ACA 1741 
Gly Asp Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ala Thr 
380 385 390 395 

GCC AGT GGA GAC TGG AAC ACA ACC ATA GCC TGT TTC GTA GCC ATA TTA 1789 
Ala Ser Gly Asp Trp Asn Thr Thr He Ala Cys Phe Val Ala He Leu 
400 405 410 

ATT GGT TTG TGC CTT ACA TTA TTA CTC CTT GCC ATT TTC AAG AAA GCA 1837 
He Gly Leu Cys Leu Thr Leu Leu Leu Leu Ala He Phe Lys Lys Ala 
415 420 425 

TTG CCA GCT CTT CCA ATC TCC ATC ACC TTT GGG CTT GTT TTC TAC TTT 1885 
Leu Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr Phe 
430 435 440 

GCC ACA GAT TAT CTT GTA CAG CCT TTT ATG GAC CAA TTA GCA TTC CAT 1933 
Ala Thr Asp Tyr Leu Val Gin Pro Phe Met Asp Gin Leu Ala Phe His 
445 450 455 

CAA TTT TAT ATC TAGCATATTT GCGGTTA6AA TCCCATGGAT GTTTCTTCTT 1985 

Gin Phe Tyr He 

460 



TGACTATAAC 


CAAATCTGGG 


GAGGACAAAG 


GTGATTTTCC 


TGTGTCCACA 


TCTAACAAAG 


2045 


TCAAGATTCC 


CGGCTGGACT 


TTTGCAGCTT 


CCTTCCAAGT 


CTTCCTGACC ACCTTGCACT 


2105 


ATTGGACTTT 


GGAAG6AGGT 


GCCTATAGAA 


AACGATTTTG 


AACATACTTC 


ATCGCAGTGG 


2165 


ACTGTGTCCT 


CGGTGCAGAA 


ACTACCAGAT 


TTGAGGGACG 


AGGTCAAGGA 


GATATGATAG 


2225 


GCCCGGAAGT 


TGCTGTGCCC 


CATCAGCAGC 


TTGACGCGTG 


GTCACAGGAC 


GATTTCACTG 


2285 


ACACTGCGAA 


CTCTCAGGAC 


TACCGGTTAC 


CAAGAGGTTA 


GGTGAAGTGG 


TTTAAACCAA 


2345 


AC6GAACTCT 


TCATCTTAAA 


CTACACGTTG 


AAAATCAACC 


CAATAATTCT 


GTATTAACT6 


2405 


AATTCTGAAC 


TTTTCAGGAG 


GTACTGTGAG 


GAAGAGCAGG 


CACCAGCAGC 


AGAATGGGGA 


2465 


ATGGAGAGGT 


GGGCAGGGGT 


TCCAGCTTCC 


CTTTGATTTT 


TTGCTGCAGA 


CTCATCCTTT 


2525 


TTAAATGAGA 


CTTGTTTTCC 


CCTCTCTTTG 


AGTCAAGTCA 


AATATGTAGA 


TGCCTTTGGC 


2585 


AATTCTTCTT 


CTCAAGCACT 


GACACTCATT 


ACCGTCTGTG 


ATTGCCATTT 


CTTCCCAAGG 


2645 


CCAGTCTGAA 


CCTGAGGTTG 


CTTTATCCTA 


AAAGTTTTAA 


CCTCAGGTTC 


CAAATTCAGT 


2705 


AAATTTTGGA 


AACAGTACAG 


CTATTTCTCA 


TCAATTCTCT 


ATCATGTTGA 


AGTCAAATTT 


2765 


GGATTTTCCA 


CCAAATTCTG 


AATTTGTAGA 


CATACTTGTA 


CGCTCACTTG 


CCCCAGATGC 


2825 


CTCCTCTGTC 


CTCATTCTTC 


TCTCCCACAC 


AAOCAGTCTT 


TTTCTACAGC 


CAGTAAGGCA 


2885 


GCTCTGTCGT 


GGTAGCAGAT 


GGTCCCACTT 


ATTCTAGGGT 


CTTACTCTTT 


GTATGATGAA 


2945 


AA6AATGTGT 


TATGAATCGG 


TGCTGTCAGC 


CCTGCTGTCA 


6ACCTTCTTC 


CACAGCAAAT 


3005 


GAGATGTATG 


CCCAAAGCGG 


TAGAATTAAA 


GAAGAGTAAA 


ATGGCTCTT6 


AAGCAAAAAA 


3065 


AAAAAAAAAA 


AAAAAAAAAA 


A 








3086 



(2) IMFORHHTION FOR SEQ ID N0:4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Glu Leu Pro Ala Pro Leu Ser Tyr Phe Gin Asn Ala Gin Met 
1 S 10 15 

Ser Glu Asp Asn His Leu Ser Asn Thr Asn Asp Asn Arg Glu Arg Gin 

20 25 30 

Glu His Asn Asp Arg Arg Ser Leu Gly His Pro Glu Pro Leu Ser Asn 
35 40 45 

Gly Arg Pro Gin Gly Asn Ser Arg Gin Val Val Glu Gin Asp Glu Glu 
50 55 60 

Glu Asp Glu Glu Leu Thr Leu Lys Tyr Gly Ala Lys His Val He Met 
65 70 75 80 

Leu Phe Val Pro Val Thr Leu Cya Met Val Val Val Val Ala Thr He 
85 90 95 

Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp Gly Gin Leu He Tyr Thr 
100 105 110 

Pro Phe Thr Glu Asp Thr Glu Thr Val Gly Gin Arg Ala Leu His Ser 
115 120 125 

He Leu Asn Ala Ala He Met He Ser Val He Val Val Met Thr He 
130 135 140 

Leu lieu Val Val Leu Tyr Lys Tyr Arg Cys Tyr Lys Val He His Ala 
145 150 155 160 

Trp Leu He He Ser Ser Leu Leu Leu Leu Phe Phe Phe Ser Phe He 
165 170 175 

Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn Val Ala Val Asp Tyr He 
IBO 185 190 

Thr Val Ala Leu Leu He Trp Asn Leu Gly Val Val Gly Met He Ser 
195 200 205 

He His Trp Lys Gly Pro Leu Arg Leu Gin Gin Ala Tyr Leu He Met 
210 215 220 

He Ser Ala Leu Met Ala Leu Val Phe He Lya Tyr X<eu Pro Glu Trp 
225 230 235 240 

Thr Ala Trp Leu He Leu Ala Val He Ser Val Tyr Asp Leu Val Ala 
245 250 255 

Val Leu Cys Pro Lys Gly Pro Leu Arg Met Leu Val Glu. Thr Ala Gin 
260 265 270 

Glu Arg Asn Glu Thr Leu Phe -Pro Ala Leu He Tyr Ser Ser Thr Met 

275 280 285 

Val Trp Leu Val Asn Met Ala Glu Gly Asp Pro Glu Ala Gin Arg Arg 
290 295 300 

Val Ser Lys Asn Ser Lys Tyr Asn Ala Glu Ser Thr Glu Arg Glu Ser 
305 310 315 320 

Gin Asp Thr Val Ala Glu Asn Asp Asp Gly Gly Phe Ser Glu Glu Trp 
325 330 335 
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Glu Ala Gin Arg Asp Ser His Leu Gly Pro His Arg Ser Thr Pro Glu 
340 345 350 

Ser Arg Ala Ala Val Gin Glu Leu Ser Ser Ser lie Leu Ala Gly Glu 
355 360 365 

Asp Pro Glu Glu Arg Gly Val Lys Leu Gly Leu Gly Asp Fhe lie Phe 
370 375 3B0 

Tyr Ser Val Leu Val Gly Lys Ala Ser Ala Thr Ala Ser Gly Asp Trp 
385 390 395 400 

Asn Thr Thr He Ala Cys Fhe Val Ala He Leu lie Gly Leu Cys Leu 
405 410 415 

Thr Leu Leu Leu Leu Ala He Phe Lys Lys Ala Leu Pro Ala Leu Pro 
420 425 430 

He Ser He Thr Phe Gly Leu Val Phe Tyr Phe Ala Thr Asp Tyr Leu 
435 440 445 

Val Gin Pro Phe' Met Asp Gin Leu Ala Phe His Gin Phe Tyr He 
450 455 460 

1.2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2494 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..2494 

(D) OTHER INFORMATION: /note- '•lExln2" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



AAGCTTTTGT 


GTGTAAAAAG 


TATTAGAATC 


TCATGTTTTT 


GAACAAGGTT 


GGCAGTGGGT 


60 


TGG6AGGAGG 


GATTGGAGAT 


TGATGCGATA 


GGAATGTGAA 


GGGATAGCTT 


GGGGTGGATT 


120 




ATTTTAATTT 


TTATTTKTTG 


AGATGGAGTC 


TTGCTCTGTC 


TCCCAGGCTG 


IBO 


GAGTGCAGTG 


GTGTGATCTC 


AGCTCACGGG 


TTCAAGCGAT 


TCTCCTGCTG 


CAGCCTCCCG 


240 


AGTAGCTGGG 


ATTACAG6AG 


CGCGCCACCA 


CACCCGGNTA 


ATTTNNTTGT 


ATTTTTAGTA 


300 


GAGACGGGGT 


TTCACCATGT 


TGGGTTAGGC 


TGGTCTAGAA 


CTCCCAACCT 


CATGATCCGC 


360 


CTGCTTCGGC 


CTCCCAAAGT 


GCCGGAATTA 


CAGGCGTGAG 


CGACTGCACC 


CGGCCGCTTG 


420 


GGGGTGGATT 


TTTAAAGAAA 


CTTTAGAAGA 


ATGTAACTTG 


SCCAGATACC 


ATGTACCGTT 


480 


AATTTCATTT 


TCGGTTTTTK 


GAATACCCAT 


GTTTGACATT 


TMTCCGTTCA 


CCTTGATTAA 


540 


ATAAGGTAGT 


ATTCATTTTT 


TAGTTTTAGC 


TTTTGGATAT 


ATGTGTAAGT 


GTGGTAT6CT 


600 


GTCTAATGAA 


TTAAGACAAT 


TGGTNCTKTC 


TTTACCCMAM 


ANCTGGACMA 


AGAGCAGGCA 


660 


AGATGCAAAA 


ATCAAGT6AC 


CCAGCAAACC 


AGACACATTT 


TCTGCTCTCA 


GCTAGCTTGC 


720 


CACCTAGAAA 


6ACTGGTTGT 


CAAAGTTGGA 


GTCCAAGAAT 


CGCG6AGGAT 


GTTTAAAATG 


780 


CAGTTTCTCA 


GGTTCTCNCC 


ACCCACCAGA 


AGTTTTGATT 


CATTGAGTGG 


TGGGAGA6GG 


840 


CAGAGATATT 


TGCGATTTTA 


ACAGCATTCT 


CTTGATTGTG 


ATGCAGCTGG 


TTCSCAAATA 


900 


GGTACCCTAA 


AGAAATGACA 


GGTGTTAAAT 


TTAGGATGGC 


CATCGCTTGT 


ATGCCGGGAG 


960 
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AAGCAOICGC TGGGCCCAAT TTATATAGG6 GCTTTCGTCC TCAGCTCGAG CARCCTCAGA 1020 

ACCCCGACAA CCYACGCCAG CKCTCTGGGC GGATTCCRTC AGKTGGGGAA GSCCAGGTGG 1080 

AGCTCTGGKT TCTCCCCGCA ATCGTTTCTC CAGGCCGGAG GCCCCGCCCC CTTCCTCCTG 1140 

GCTCCTCCCC TCCTCCGTGG GCCGNCCGCC AACGACGCCA GAGCCGGAAA TGACGACAAC 1200 

GGTGAGGGTT CTCX;GGCGGG GCCTGGGACA GGCAGCTCCG GGGTCCGCGG TTTTCACATC 1260 

GGAAACAAAA CAGCGGCTGG TCTGGAAGGA ACCTGAGCTA CGACCCGCGG CGGCAGCGGG 1320 

GCGGCGGGGA AGCGTATGTG CGTGATGGGG AGTCCGGGCA AGCCAGGAAG GCACCGCGGA 1380 

CATGGGCGGC CGCGGGCAGG GNCC6GNCCT TTGTGGCCGC CCGGGCCGC6 AA6CCGGTGT 1440 

CCTAAAAGAT GAGGGGCGGG GCGCGGCCGG TTGGGGCTGG GGAACCCCGT GTGGGAAACC 1500 

AGGAGGG6C6 GCCCGTTTCT CGGGCTTCGG GCGCGGCCGG GTGGAGAGAG ATTCCGGG6A 1560 

GCCTTGGTCC GGAAATGCTG TTTGCTCGAA GACGTCTCAG GGCGCAGGTG CCTTGGGCCG 1620 

GGATTAGTAG CCGTCTGAAC TGGAGTGGAG TAGGAGAAAG AGGAAGCGTC TTGGGCTGGG 1680 

TCTGCTTGAG CAACTGGTGA AACTCCGCGC CTCACGCCCC GGGTGTGTCC TTGTCCAGGG 1740 

GCGACGAGCA TTCTGGGCGA AGTCCGCACG CCTCTTGTTC GAGGCGGAAG AC6GGGTCTT 1800 

GATGCTTTCT CCTTGGTCGG GACTGTCTCG AGGCATGCAT GTCCAGTGAC TCTTGTGTTT 1860 

GCTGCTGCTT CCCTCTCAGA TTCTTCTCAC CGTTGT6GTC AGCTCTGCTT TAGGCATATT 1920 

AATCCATAGT GGAGGCTGGG ATGGGTGAGA GAATTGAGGT GACTTTTCCA TAATTCAGGT 1980 

GAGATGTGAT TAGAGTYCGG ATCCTNCGGT G6TGGCAGAG GCTTACCAAG AAACACTAAC 2040 

GGGACATGGG AACCAATTGA GGATCCAGGG AATAAAGTGT GAAGTTGACT AGGAGGTTTT 2100 

CAGTTTAAGA ACATGGCAGA GACATTCTCA GAAATAAGGA AGTTAGGAAG AAAGACCTGG 2160 

TTTAGAGAGG AGGGCGAGGA AGTGGTTTGG AAGTGTCACT T7GGAAGTGC CAGCAGGTGA 2220 

AAATGCCCTG TGAACAGGAC TGGAGCTGAA AACAGGAATC AATTCCATAG ATTTCCAGTT 2280 

GATGTTGGA6 CAGTGGAGAA GTCTAANCTA AGGAAGGGGA AGAGGAGGCC AAGCCAAACA 2340 

CTTAGGAACA CTTNCNACGA GGGGGTGGAA GAAGAGCAAG GAGCCA6CTG AGGACSAATGA 2400 

GTGTGGTTGG A6AACCACCA CAGCNCAGGG TCGCCAGANC TGAGGAAGGG 6AGGGAAGCT 2460 

TATCGAGKAM SGHCRACHRC GA6TTG0CAG GGAT 2494 

(2) INFORMATION FOR SBQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: miac_f eature 

(B) LOCATION: 1..1117 

(D) OTHER INFORMATION: /notes •*lEx3n4'* 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

GGATCCGCCC GCCTTGGCCT CCCAAAGTGC TGGGATTACA GGCATGAGCC ACCGCTCCTG 60 

GCTGAOTCTG CGATTTCTTG CCAGCTCTAC CCAGTTGTGT CATCTTAAGC AAGTCACTGA 120 

ACTTCTCTGG ATTCCCTTCT CCTNNWGTAA AATAAGNATG TTATCTGNCC NNCCTGCCTT 180 
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GGGCA7TGTG 


ATAAGGATAA 


GAT6ACATTA 


TAGAATNTK6 CAAAATTAAA AGC6CTAGAC 


240 


AAATGATTTT 


ATGAAAATAT 


AAA6ATTAGN 


TTGAGTTTGG GCCAGCATAG 


AAAAAGGAAT 


300 


GTTGAGAACA 


TTCOITTAAG 


GATTACTCAA GCYCCCCTTT TGSTGKNWAA TCAGANNGTC 


360 


ATNNAMNTAT 


CNTNTGTGGG 


YTGAAAATGT 


TTGGTTGTCT CAGGCGGTTC 


CTACTTATTG 


420 


CTAAAGAGTC 


CTACCTTGAG 


CTTATAGTAA 


ATTTGTCAGT TAGTTGAAAG 


TCGTGACAAA 


480 


TTAATACATT 


CCTGGTTTAC 


AAATTGGTCT 


TATAAGTATT TGATTGGTNT 


AAATGNATTT 


540 


ACTAGGATTT 


AACTAACAAT 


6GATGACCTG 


GTGAAATCCT ATTTCAGACC 


TAATCTGGGA 


600 


GCCTGCAAGT 


GACAACAGCC 


TTTGC6GTCC 


TTAGACAGCT TGGCCTGGAO 


6AGAACACAT 


660 


GAAAGAAAGG 


TTTGWIITCTG 


KTTAWTGTAA TCTATGRAAG TCTTTTTWAT MACAGTATAA 


720 


TTGTMTGMAC 


AAAGTTCTGT 


TTrrCTTTCC 


CTTTNCAGAA CCTCAAGAGG 


CTTTGTTTTC 


780 


TOTGAAACAG 


TATTTCTATA 


CAGTTGCTCC 


AATGACAGAG TTACCTGCAC 


CGTTGTCCTA 


840 


CTTCCAGAAT 


GCACAGATGT 


CTGAGGACAA 


CCACCTGAGC AATACTGTAC 


GTA6CCAGGT 


900 


ACAGCG7CAG 


TYTCTNAAAC 


TGCCTYYGNC 


AGACTGGATT CACTTATCAT 


CTCCCCTCAC 


960 


CTCTGAGAAA 


TGCTGAGGGG 


GSTAGGNAGG 


GCTTTCTCTA CTTNACCACA 


TTTNATAATT 


1020 


ATTTTTGGGT 


GACCTTCAGC 


TGATCGCTGG 6AGGGACACA GGGCT'INTTT 


AACACATAGG 


1080 


GTGTTGGATA 


CAGNCCCTCC 


CTAATTCACA 


TTTCANC 




1117 



(2) INFORMATION FOR SEQ ID NO: 7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1727 base pairs 

(B) TYPB: nucleic acid 
(C} STRANDEDNE55: single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..1727 

(D) OTHER INFORMATION: /note» "lExS* 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 



GGATCCCTCC 


CCTTTTTAGA 


CCATACAAGG 


TAACTTCCGG 


ACGTTGCCAT 


GGCATCTGTA 


60 


AACTGTCATG 


GT6TTGGCGG 


GGAGTGTCTT 


TTAGCATGCT 


AAT6TATTAT AATTAGCGTA 


120 


TAGT6AGCAG 


TGAGGATAAC 


CAGAGGTCAC 


TCTCCTCACC 


ATCTTGGTTT 


TGGTGGGTTT 


180 


TGGCCAGCTT 


CTTTATTGCA 


ACCAGTTTTA 


TCAGCAAGAT 


CTTTATGAGC 


TGTATCTTGT 


240 


GCTGACTTCC 


TATCTCATCC 


CGNAACTAAG 


AGTACCTAAC 


CTCCTGCAAA 


TTGMAGNCCA 


300 


GNAGGTCTTG 


GNCTTATTTN 


ACCCAGCCCC 


TATTCAARAT 


AGAGTNGYTC 


TTGGNCCAAA 


360 


CGCCYCTGAC 


ACAAGGATTT 


TAAAGTCTTA 


TTAATTAAGG 


TAAGATAGKT 


CCTTGSATAT 


420 


GTGGTCTGAA 


ATCACA6AAA 


GCTGAATTTG 


GAAAAAGGTG 


CTTGGASCTG 


CAGCCAGTAA 


480 


ACAAGTTTTC 


AT6CAGGTGT 


CAGTATTTAA 


GGTACATCTC 


AAAGGATAAG 


TACAATTGTG 


540 


TATGTTGGGA 


TGAACAGAGA 


GAATGGAGCA 


ANCCAAGACC 


CAGOTAAAAG AGAGGACCTG 


600 


AATGCCTTCA 


GTGAACAATO 


ATAGATAATC 


TAOACTTTTA 


AACTGCATAC 


TTCCTGTACA 


660 


TTOTTTTTTC 


TTGCTTCAGG 


TTTTTAGAAC 


TCATAGTGAC 


GGGTCTGTTG 


TTAATCCCAG 


720 


GTCTAACCGT 


TACCTTGATT 


CTGCTGAGAA 


TCTGATTTAC 


TGAAAATGTT 


TTTCTTGTGC 


780 
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TTATAGAATG 


ACAATA6AGA 


ACGGCAGGAG 


CACAACGACA 


GACGGAGCCT 


TGGCCACCCT 


840 


6AGCCATTAT 


CTAATGGACG 


ACCCCAG66T 


AACTCCCGGC 


AGGTGGTGGA 


GCAAGATGAG 


900 


GAAGAAGATG 


AGGAGCTGAC 


ATTGAAATAT 


GGCGCCAAGC 


ATGTGATCAT 


GCTCTTTGTC 


960 


CCTGTGACTC 


TCTGCATGGT 


GGTGGTCGTG 


GCTACCATTA 


AGTCAGTCAG 


CTTTTATACC 


1020 


CGGAAGGATG 


GGCAGCTGTA 


CGTATGAGTT 


TKGTTTTATT 


ATTCTCAAAS 


CCAGTGTGGC 


1060 


TTTTCTTTAC 


AGCATGTCAT 


CATCACCTTG 


AAGGCCTCTN 


CATTGAAGGG 


GCATGACTTA 


1140 


6CTGGAGAGC 


CCATCCTCTG 


T6AT6GTCAG 


GAGCAGTTGA 


GAGAMCGAGG 


GGTTATTACT 


1200 


TCATGTTTTA 


AGTGGAGAAA 


AGGAACACTG 


CAGAAOTATG 


TTTCCTGTAT 


GGTATTACTG 


1260 


GATAGGGCTG 


AAGTTATGCT 


GAATTGAACA 


CATAAATTCT 


TTTCCACCTC AGGGNCATTG 


1320 


GGCXSCCCATT 


GNTCTTCTGC 


CTAGAATATT 


CTTTCCTTTN 


CTNACTTKGG 


NGGATTAAAT 


1380 


TCCTGTCATC 




TGGTOTTATA 


TATAAAOTNT 


TGGTGCCGCA 


AAAGAA6TAG 


1440 


CACTCGAATA 


TAAAATTTTC 


CTTTTAATTC 


TCAGCAAGGN 


AAGTTACTTC 


TATATAGAAG 


1500 


GGTGCACCCN 


TACAGATGGA 


ACAATGGCAA 


GCGCACATTT 


GGGACAAGGG 


AGGGGAAAGG 


1560 


GTTCTTATCC 


CTGACACACX3 


TGGTCCC3JGC 


TGNTGTGTNC 


TNCCCCCACT 


GANTAGGGTT 


1620 


AGACTGGACA 


GGCTTAAACT 


AATTCCAATT 


GGNTAATTTA 


AAGAGAATNA 


TGGGGTGAAT 


1680 


GCTTTGGGAG 


GA6TCAAGGA 


AGAGNAGGTA 


GNAGGTAACT 


TGAATGA 




1727 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1883 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

( ix) FEATURE : 

(A) NAME/KEY: inisc_f eature 

(B) LOCATION: 1..1863 

(D) OTHER INFORMATION: /note* "lEx6" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CNCGTATAAA AGACCAACAT TGCCANCNAC AACCACAGGC AAGATCTTCT CCTACCTTCC 60 

CCCNNGGTGT AATACCAAGT ATTCNCCAAT TTGTGATAAA CTTTCATTGG AAAOTGACCA 120 

CCCTCCTTGG TTAATACATT GTCTGTGCCT GCTTTCACAC TACAGTAGCA CAGTTGAGTG 180 

TTTGCCCTGG AOACCATATG ACCCATAGAG CTTAAAATAT TCAGTCTGGC TTTTTACAGA 240 

GATGTTTCTG ACTTTGTTAA TAGAAAATCA ACCCAACTGG TTTAAATAAT GCACATACTT 300 

TCTCTCTCAT AGAGTAGTGC AGAGGTAGNC AGTCCAGATT AGTA5GGTGG CTTCACGTTC 360 

ATCCAAGGAC TCAATCTCCT TCTTTCTTCT TTAGCTTCTA ACCTCTAGCT TACTTCAGGG 420 

TCCAGGCTGG AGCCCTASCC TTCATTTCTG ACAGTAGGAA GGAGTAGGGG AGAAAAGAAC 480 

ATAGGACATG TCAGCAGAAT TCTCTCCTTA GAAGTTCCAT ACACAACACA TCTCCCXAGA 540 

AGTCATTGCC CTTACTTGTT CTCATA6CCA TCCTAAATAT AAGGQAGTCA GAAGTAAAGT 600 

CTKKNTOGCT GGGAATATTG GCACCTGGAA TAAAAATGTT TTTCTGTGAA TGAGAAACAA 660 

GGGGAAGATG GATATGTGAC ATTATCTTAA GACAACTCCA GTTGCAATTA CTCTGCAGAT 720 

GAGAGGCACT AATTATAAGC CATATTACCT TTCTTCTGAC AACCACTTGT CAGCCCNCGT 780 
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GGTTTCTGTG 


GCAQAATCTG 


GTTCYATAMC 


AAGTTCCTAA 


TAANCTGTA5 CCNAAAAAAT 


840 


TTGATGAGGT 


ATTATAATTA 


TTTCAATATA 


AAGCACCCAC 


TAGATGGAGC CA6TGTCTGC 


900 


TTCACATGTT 


AAGTCCTTCT 


TTCCATATGT 


TAGACATTTT 


CTTTGAAGCA ATTTTAGAGT 


960 


GTAGCTGTTT 


TTCTCAGGTT 


AAAAATTCTT 


AGCTAGGATT 


GGTGAGTTGG GGAAAAGTGA 


1020 


CTTATAAGAT 


NCGAATTGAA 


TTAAGAAAAA 


GAAAATTCTG 


TGTTGGAGGT GGTAATGTGG 


1080 


KTGGTGATCT 


YCATTAACAC 


TGANCTAGGG 


CTTTKGKGTT 


TGKTTTATTG TAGAATCTAT 


1140 


ACCCCATTCA 


CAGAAGATAC 


CGAOACTGTG 


GGCCAGAGAG 


CCCTGCACTC AATTCTGAAT 


1200 


GCTGCCATCA 


TGATCAGT6T 


CATTGTTGTC 


ATGACTATCC 


TCCTGGTGGT TCTGTATAAA 


1260 


TACAGGTGCT 


ATAAG6TGAG 


CATGAGACAC 


AGATCTTTGN 


TTTCCACCCT GTTCTTCTTA 


1320 


TGGTTGGGTA 


TTCTTGTCAC 


AGTAACTTAA 


CTGATCTAGG 


AAAGAAAAAA TGTTTTGTCT 


1380 


TCTAGAGATA 


AGTTAATTTT 


TAGTTTTCTT 


CCTCCTCACT 


GTGGAACATT CAAAAAATAC 


1440 


AAAAAGGAAG 


CCAGGTGCAT 


GTGTAATGCC 


AGGCTCAGAG 


GCTGAGGCAG GAGGATCGCT 


1500 


7GGGCCCAGG 


AGTTCACAAG 


CAGCTTGGGC 


AACGTAGCAA 


GACCCTGCCT CTATTAAAGA 


1560 


AAACAAAAAA 


CAAATATTGG 


AAGTATTTTA 


TATGCATGGA 


ATCTATATGT CATGAAAAAA 


1620 


TTAGTGTAAA 


ATATATATAT 


TAT6ATTA6N 


TATCAAGATT 


TAGTGATAAT TTATGTTATT 


1680 


TTGGGATTTC 


AATGCCTTTT 


TAGGCCATTG 


TCTCAAMAAA 


TAAAAGCAGA AAACAAAAAA 


1740 


AGTTGTAACT 


GAAAAATAAA 


CATTTCCATA 


TAATAGCACA 


ATCTAAGTGG GTTTTTGNTT 


1800 


GTTTGTTTGK 


TTGTTGAAGC 


AGGGCCTTGC 


CCTNYCACCC 


AGGMTG6AGT GAAGTGCAGT 


1860 


G6CACGATTT 


TGGCTCACTG 


CAG 






1883 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1, .923 

(D) OTHER INFORMATION: /note- "lEx7» 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CAGGAGTGGA CTAGGTAAAT GNAAGNTGTT TTAAAGAGAG ATGNGGNCNG GGACATAGTG 60 

GTACACANCT GTAATGCTCA NCACTKATGG GGAGTACTGA AGGNGGNSGG ATCACTTGNG 120 

GGTCNGGAAT NTGAGANCAG CCTGGGCAAN AT6GC6AAAC CCTGTCTCTA CTAAAAATAG 180 

CCANAAWHNA GCCTAGC6TG 6TGGC6CRCA CGCGTGGTTC CACCTACTCA GGAGGCNTAA 240 

GCACGAGNAN TNCTTGAACC CAGGAGGCAG AGGNTGTGGT GARCTGAGAT CGTGCCACTG 300 

CACTCCAGTC TGGGCGACHA AGTGA6ACCC TGTCTCCNNN AAGAAAAAAA AAATCTGTAC 360 

TTTTTAAGGG TTGTGGGACC TGTTAATTAT ATTGAAATGC TTCTYTTCTA GGTCATCCAT 420 

GCCTGGCTTA TTATATCATC TCTATTGTTG CTGTTCTTTT TTTCATTCAT TTACTTGGGG 4 80 

TAAGTTGTGA AATTTGGGGT CTOTCTTTCA GAATTAACTA CCTNNGTGCT GTGTAGCTAT 540 
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CATTTAAAGC CATGTACTTT GNTGATGAAT TACTCTGAAG TTTTAATrGT NTCCACATAT £00 

AGGTCATACT TCGTATATAA AAGACTAGNC AGTATTACTA ATTGAOACAT TCTTCTGTNG 660 

CTCCTNGCTT ATAATAAGTA GAACTGAAAG NAACTTAAGA CTACAGTTAA TTCTAAGCCT 720 

TTGGGGAAGG ATTATATAGC CTTCTAGTAG GAAGTCTTGT GCNATCAGAA TGTTTNTAAA 780 

GAAAGGGTNT CAAGG^TKG TATAAAHACC AAAAATAATT GAT 823 
(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: raise feature 

(B) LOCATION: 1..945 

(D) OTHER INFORMATION: /note* "lExS" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GTTNTCCNAA CCAACTTAGG AGNTTGGACC TGGGRAAGAC CNACNTGATC TCCGGGA6GN 60 

AAAGACTNCA GTTGAGCCGT GATTGCACCC ACTTTACTCC AAGCCTGGGC AACCAAAATG 120 

AGACACTGGC TCCAAACACA AAAACAAAAA CAAAAAAAGA GTAAATTAAT TTANAGGGAA 180 

GNATTAAATA AATAATAGCA CAGTTGATAT AGGTTATGGT AAAATTATAA AGGTGGGANA 240 

TTAATATCTA ATGTTTGGGA GCCATCACAT TATTCTAAAT AATGTTTTGG TGGAAATTAT 300 

TOTACATCTT TTAAAATCTG TGTAATTTTT TTTCAGGGAA GTGTTTAAAA CCTATAACGT 360 

TGCTGTGGAC TACATTACTG TTGCACTCCT GATCTGGAAT TTTGGTGTGG TGGGAATGAT 420 

TTCCATTCAC TGGAAAGGTC CACTTCGACT CCAGCAGGCA TATCTCATTA TGATTACTGC 4 80 

CCTCATGGCC CTGOTOTTTA TCAAGTACCT CCCTGAATGG ACTGCGTGGC TCATCTTGGC 540 

TGTGATTTCA GTATATGGTA AAACCCAAGA CTGATAATTT GTTTGTCACA GGAATGCCCC '600 

ACTGGAGTGT TTTCTTTCCT CATCTCTTTA TCTTGATTTA GAGAAAATGG TAACGTGTAC 660 

ATCCCATAAC TCTTCA6TAA ATCATTAATT AGCTATAGTA ACTTTTTCAT TTGAAGATTT 720 

CGGCTGGGCA TGGTAGCTCA TGCCTGTAAT CTTAGCACTT TGGGAGGCTG AGGCGGGCAG 780 

ATCACCTAAG CCCAGAGTTC AAGACCAGCC TGGGCAACAT GGCAAAACCT CGTATCTACA 840 

GAAAATACAA AAATTAGCCG GGCATGGTGG TGCACACCTG TAGTTCCAOC TACTTAGGAG 900 

GCTGAGGTGG GAGGATCGAT TGATCCCAGG AGGTCAAGNC TGCAG 945 
(2> INFORMATION FOR SEQ ID NO:ll: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTGCAGCTTT CCTTTAAACT AGGAAGACTT GTTCCTATAC CCCAGTAACG ATACACTGTA 60 
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CACTAAGCAA 


ATAGCAGTCA AACCCAAATG 


AAATTTNTAC 


AGATGTTCTG TGTCATTTTA 


120 


TNTTGTTTAT 


GTTGTCTCCC CCACCCCCAC 


CAGTTCACCT 


GCCATTTATT TCATATTCAT 


180 


TCAACX3TCTN 


NNTGTGTAAA AAGAGACAAA 


AAACATTAAA 


CnTTTTCCT TCGTTAATTC 


240 


CTCCCtACCA 


CCCATTTACA AGTTTAGCCC 


ATACATTTTA 


TTAGATGTCT TTTATGTTTT 


300 


TCTTTTNCTA 


GATTTAGTGG CTGTTTTGTG 


TCCGAAAGGT 


CCACTTCGTA TGCTGGTTGA 


360 


AACAGCTCAG 


GAGAGAAAT6 AAACGCTTTT 


TCCAGCTCTC 


ATTTACTCCT GTAAGTATTT 


420 


GGAGAATGAT 


ATTGAATTAG TAATCAGNGT 


AGAATTTATC 


GGGAACTTGA AGANATGTNA 


480 


CTATGGCAAT 


TTCANGGNAC TTGTCTCATC TTAAATGANA GNATCCCTGG ACTCCTGMAG 


540 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 9 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

{A} NAME/KEY: misc feature 

(B) LOCATION: 1..509 

(D) OTHER INFORMATION: /note- "lExlO" 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CCCCGTCNAT GCATACTTTG TGTGTCCAGT GCTTACCTGG AATCCNGTCT TTCCCAACAG 60 

CAACAAT6GT GTGGTTGGTG AATATGGCAG AAGGAGACCC G6AAGCTCAA AGGAGA6TAT 120 

CCAAAAATTC CAA6TATAAT GCAGAAAGTA GGTAACTYYY NTTAGATAMN ATCTTGATTT 180 

TNCAGGGTCA CTGTTATAAG CTAACAGTAT AGNAATGTTT TTATCGTCTT TCTNXGGNCA 240 

TAGACTCCTN KGAGAATCTC TTGAGAACTA TGATAATGCC CAGTAAATAC NCAGATAAGT 300 

ATTTAAGGAG TNCAGATACT CAAANCCCAA CAATACNGTC AAAGCATCCT AGGTTAAGAC 360 

AMCNCCCATT AAATACAGAA TACCAGCATG GAAAGGTTCA GGCTGAGGTT ATGATTGGGT 420 

TTGGGTTTTG GGNNNGTTTT TTATAAGTCA TGATTTTAAA AAGAAAAAAT AAACTCTCTC 480 

CAAACATGTA AAAGTAA6AA TCTCCTAAA 509 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1092 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..1092 

(D) OTHER INFORMATION: /note* "lExll" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTCTAGATAA GNCAACATTC A6GGGTAGAA GGGGACTGTT TATTTTTTCC TTTAGTCTCT 60 
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CTTAAAGAGT GAGAAAAATT TTCCCAGGAA TCCCGGTGGA CTTTGCTTCA CCACTCATAG 120 

GTTCATACCA AGTTACAACC CCACAACCTT AGAGCTTTTG TTAGGAAGAG GCTTGGTGGG 180 

ATTACCGTGC TTGGCTTGGC TTGGTCAGGA TTCACCACCA GAGTCATGTG GGAGGGGGTG 240 

GGAACCCAAA CAATTCAGGA TTCTGCCCTC AGGAAATAAA GGAGAAAATA GCTGTTGGAT 300 

AAACTACCAG CAGGCACTGC TACAGCCCAT GCTTTGTGGT TTAAGGGCCA GCTAGTTACA 360 

ATGACAGCTA GTTACTGTTT CCATGTAATT TTCTTAAAGG TATTAAATTT TTCTAAATAT 420 

TAGAGCTGTA ACTTCCACTT TCTCTTGAAG GCACAGAAAG GGAGTCACAA GACACTGTTG 480 

CA6AGAATGA TGATGGCGGG TTCAGTGAGG AATGGGAAGC CCA6AGGGAC AGTCATCTAG 540 

GGCCTCATCG CTCTACACCT GAGTCACGAG CTGCTGTCCA GGAACTTTCC AGCAGTATCC €00 

TCGCTGGTGA AGACCCA6AG 6AAAGTATGT TCANTTCTCC ATNTTTCAAA GTCATGGATT 6€0 

CCTTTAGGTA GCTACATTAT CAACCTTTTT GAGAATAAAA TGAATTGAGA GTGTTACAGT 720 

CTAATTCTAT ATCACATGTA ACTTTTATTT GGATATATCA GTAATAGTGC TTTTTYNTTT 780 

TTTTTTTTTT tTTTTTTTTT TTTTNGGNGA NAGAGTCTCG CTCTGTCGCC AGGTTGGAGT 84 0 

GCAATGGTGC GATCTTGGCT CACTGAAAGC TCCACCNCCC GGGTTCAAGT GATTCTCCTG SOO 

CCTCAGCCNC CCAAGTAGNT GGGACTACAG GGGTGCGCCA CCACGCCTGG GATAATTTTG 960 

GGNTTTTTAG TAGAGATGGC GTTTCACCAN CTTGGHGCAG GCTGGTCTTG GAACTCCTGA 1020 

NATCATGATC TGCCTGCCTT AGCCTCCCCA AAOTGCTGGG ATTMCAGGGG TGA6CCACTG 1080 

TTCCTGGGCC TC 1092 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ix) FEATURE: 

{A) NAME/KEY: misc_feature 
(B) LOCATION: 1. .1003 

(O) OTHER INFORMATION: /noce- »lExl2- 

(Xi) SEQXTENCE DESCRIPTION: SEQ ID NO: 14: 

CTGCAGTGAG CCGAGATCAT GCTGCTGTAC TCCAGCCTGG GCCACAGAGC CAAACTCCAT 60 

CTCCCAAAAA AAAAAAATAT TAATTAATAT GATNAAATGA TGCCTATCTC AGAATTCTTG 120 

TAACGATTTC TTAGKACAAG TGCTGGGTAT AAACTATANA TTCRATAGAT GNCGATTATT 180 

ACTTAYTATT GTTATTGATA AATAACAGCA GCATCTACAG TTAAGACTCC AGAGTCAGTC 240 

ACATAGAATC TGGNACTCCT ATTGTAGNAA ACCCCNMNAS AAAGAAAACA CAGCTQAAGC 300 

CTAATTTTGT ATATCATTTA CTGACTTCTC TCATTCATTG TG6GGTTGAG TAGGGCAGTG 360 

ATATTTTTGA ATT6TGAAAT CATANCAAAG AGTGACCAAC TTTTTAATAT TTGTAACCTT 420 

TCCTTTTTAG GGGGAGTAAA ACTTGGATTG GGAGATTTCA TTTTCTACAG TGTTCTGGTT 480 

GGTAAAGCCT CAGCAACAGC CAGTGGAGAC TGGAACACAA CCATAGCCTG TTTCGTAGCC 540 

ATATTAATTG TMMSTATACA CTAATAAGAA TGTGTCAGAG CTCTTAATGT CMAAACTTTG 600 
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ATTACACAGT CCCTTTAAGG CAGTTCTGTT TTAACCCCAG GTGGGTTAAA TATTCCAGCT 660 

ATCTGAGGAG CTTTTNGATA ATTGGACCTC ACCTTAGTAG TTCTCTACCC TGGCCACACA 720 

TTAGAATCAC TTGGGAGCTT TTAAAACTGT AAGCTCTGCC CTGAGATATT CTTACTCAAT 780 

TTAATTGTGT AGTtTTTAAA ATTCCCCAGG AAATTCTGGT ATTTCTGTTT AGGAACCGCT 840 

GCCTCAAGCC TAGCAGCACA GATATGTAGG AAATTAGCTC TGTAAGGTTG GTCTTACAGG 900 

GATAAACAGA TCCTTCCTTA GTCCCTGGAC TTAATCACTG AGAGTTTGGG TGGTGGTTTT 960 

GGATTTAATG ACACAACCTG TAGCATGCAG TGTTACTTAA GAC 1003 

(2) INFORMATION FOR SEQ ID NO: IS: 

(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: liilfiar 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..736 

(D) OTHER INFORMATION: /note* "1EX13" 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: IS: 

GTCTTTCCCA TCTTCTCCAC A66GTTTGTG 

GAAAGCATTG CCAGCTCTTC CAATCTCCAT 

AGATTATCTT GTACAGCCTT TTATGGACCA 

TATTTGCGGT TAGAATCCCA TGGATGTTTC 

CAAAGGTGAT TTCCTGTGTC CACATCTAAC 

GTTCCTTCCA AGTCTTCCTG ACCACCTTGC 

GAAAACGATT TTGAACATAC TTCATCGCAG 

GATTTGAGGG ACGAGGTCAA GGAGATATGA 

AGCTTGACGC GTGGTCACAG GACGATTTTC 

TACCAAGAGG TTAGGTGAAG TG6TTTAAAC 

TTGAAAATCA ACCCAATAAT TCTGTATTAA 

GAGGAAGAGC AGGCACCACC AGCAOAATGG 

TCCCTTTGAT TTTTTG 

(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1964 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 188.. 1508 



CCTTACATTA TTACTCCTTG 
CACCTTTGGG CTTGTTTTCT 
ATTAGCATTC CATCAATTTT 
TT C TTT GA CT ATAACAAAAT 
AAATCAAGAT CCCCGGCTGG 
ACTATTGGAC TTTGGAAGGA 
TG6ACTGTGT CCTCGGTGCA 
TAGGCCCGGA AGTTGCTGTG 
ACTGACACTG CGAACTCTCA 
CAAACGGAAC TCTTCATCTT 
CTGAATTCTG AACTTTTCAG 
GGAATGGAGA GGTGGGCAGG 



CCATTTTCAA 60 

ACTTT G CCAC 120 

ATATCTAGCA 180 

CTGGGGAGGA 240 

ACTTTTGGAG 300 

GGTGCCTATA 360 

GAAACTACCA 420 

CCCCATCAGC 480 

GGACTACCGT 540 

AAACTACACG 600 

GAGGTACTGT 660 

GGTTCCAGCT 720 
736 
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(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..1964 

(D) OTHER INFORMATION: /note= "mPSl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

ACCANACANC GGCAGCTGAG GCGGAAACCT AGGCTGCGAG CCGGCCGCCC GGGCGCGGAG €0 

AGAGAAGGAA CCAACACAAG ACTIGCAGCCC TTCGAGGTCT TTAGGCAGCT TGGAGGAGAA 120 

CACATGAGAG AAAGAATCCC AAGAGGTTTT GTTTTCTTTG AGAAGGTATT tCTGTCCAGC 180 

TGCTCCA ATG ACA GAG ATA CCT GCA CCT TTG TCC TAG TTC CAG AAT GCC 229 
Met Thr GIu He Pro Ala Pro Leu Ser Tyr Phe Gin Aan Ala 
15 10 

CAG ATG TCT GAG GAG AGO CAC TCC AGC AGC GCC ATC CGG AGC CAG AAT 277 
Gin Met Ser Glu Asp Ser Hia Ser Ser Ser Ala He Arg Ser Gin Aan 
15 20 25 30 

6AC AGC CAA GAA CGG CAG CAG CAG CAT GAC A6G CAG AGA CTT GAC AAC 325 
Aap Ser Gin Glu Arg Gin Gin Gin Hia Aap Arg Gin Arg Leu Asp Aan 
35 40 45 

CCT GAG CCA ATA TCT AAT GGG CGG CCC CAG AGT AAC TCA AGA CAG GTG 373 
Pro Glu Pro He Ser Aan Gly Arg Pro Gin Ser Aan Ser Arg Gin Val 
50 55 60 

GTG GAA CAA GAT GAG GAG GAA GAC GAA GAG CTG ACA TTG AAA TAT GGA 421 
Val Glu Gin Aap Glu GXu Glu Aap Glu Glu Leu Thr Leu Lya Tyr Gly 
6B 70 75 

GCC AAG CAT GTC ATC ATG CTC TTT GTC CCC GTG ACC CTC TGC ATG GTC 469 
Ala Lya Hia Val He Net Leu Phe Val Pro Val Thr Leu Cys Met Val 
80 85 90 

GTC GTC GTG GCC ACC ATC AAA TCA GTC AGC TTC TAT ACC CGG AAG GAC 517 
Val Val Val Ala Thr He Lya Ser Val Ser Phe Tyr Thr Arg Lya Aap 
95 100 105 110 

GGT CAG CTA ATC TAG ACC CCA TTC ACA GAA GAC ACT GAG ACT GTA GGC 565 
Gly Gin Leu He Tyr Thr Pro Phe Thr Glu Aap Thr Glu Thr Val Gly 
115 120 125 

CAA AGA GCC CTG CAC TCG ATC CTG AAT GCG GCC ATC ATG ATC AGT GTC 613 
Gin Arg Ala Leu Hia Ser He Leu Aan Ala Ala He Met He Ser Val 
130 135 140 

ATT GTC ATT -ATG ACC" ATC CTC CTG GTG GTC CTG TAT AAA TAC AGG TGC 661 
He Val He Met Thr He Leu Leu Val Val Leu Tyr Lya Tyr Arg Cya 
145 150 155 

TAC AAG GTC ATC CAC GCC TGG CTT ATT ATT TCA TCT CTG TTG TTG CTG 709 
Tyr Lya Val He Hia Ala Trp Leu He He Ser Ser Leu Leu Leu Leu 
160 165 170 

TTC TTT TTT TCG TTC ATT TAC TTA GGG GAA GTA TTT AAG ACC TAC AAT 757 
Phe Phe Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lya Thr Tyr Aan 
175 180 185 190 

GTC GCC GTG GAC TAC GTT ACA GTA GCA CTC CTA ATC TGG AAT TTT GGT 805 
Val Ala Val Asp Tyr Val Thr Val Ala Leu Leu He Trp Asn Phe Gly 
195 200 205 

GTG GTC GGG ATG ATT GCC ATC CAC TGG AAA GGC CCC CTT CGA CTG CAG 853 
Val Val Gly Met He Ala He Hia Trp Lya Gly Pro Leu Arg Leu Gin 
210 215 220 

CAG GCG TAT CTC ATT ATG ATC AGT GCC CTC ATG GCC CTG GTA TTT ATC 901 
Gin Ala Tyr Leu He Met He Ser Ala Leu Met Ala Leu Val Phe He 
225 230 235 
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AAG TAC CTC CCC GAA TGG ACC GCA TGG CTC ATC TTG GCT GTG ATT TCA 949 
Lys Tyr Leu Pro Glu Trp Thr Ala Trp Leu lie Leu Ala Val He Ser 
240 245 250 

GTA TAT GAT TTG GTG GCT GTT TTA TGT CCC AAA GGC CCA CTT CGT ATG 997 
Val Tyr Asp Leu Val Ala Val Leu Cys Pro Lys Gly Pro Leu Arg Met 
255 260 265 270 

CTG GTT GAA ACA GCT CAG GAA AGA AAT GAG ACT CTC TTT CCA GCT CTT 1045 
Leu Val Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu 
275 280 285 

ATC TAT TCC TCA ACA ATG GTG TGG TTG GTG AAT ATG GCT GAA GGA GAC 1093 
He Tyr Ser Ser Thr Met Val Trp Leu Val Aan Met Ala Glu Gly Asp 
290 295 300 

CCA GAA GCC CAA AGG AGG GTA CCC AAG AAC CCC AAG TAT AAC ACA CAA 1141 
Pro Glu Ala Gin Arg Arg Val Pro Lye Asn Pro Lys Tyr Asn Thr Gin 
305 310 315 

AGA GCG GAG AGA GAG ACA CAG GAC AGT GGT TCT GGG AAC GAT GAT GGT 1189 
Arg Ala Glu Arg Glu Thr Gin Asp Ser Gly Ser Gly Asn Asp Asp Glv 
320 325 330 f ^ y 

GGC TTC AGT GAG GAG TGG GAG GCC CAA AGA GAC AGT CAC CTG GGG CCT 1237 
Gly Phe Ser Glu Glu Trp Glu Ala Gin Arg Asp Ser His Leu Gly Pro 
335 340 345 350 

CAT CGC TCC ACT CCC GAG TCA AGA GCT GCT GTC CAG GAA CTT TCT GGG 1285 
His Arg Ser Thr Pro Glu Ser Arg Ala Ala Val Gin Glu he\i Ser Gly 
355 360 365 

AGC ATT CTA ACG AGT GAA GAC CCG GAG GAA AGA GGA GTA AAA CTT GGA 1333 
Ser He Leu Thr Ser Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Gly 
370 375 380 

CTG GGA GAT TTC ATT TTC TAC ACT GTT CTG GTT GGT AAG GCC TCA GCA 1381 
Leu Gly Asp Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ala 
385 390 395 

ACC GCC AGT GGA GAC TGG AAC ACA ACC ATA GCC TGC TTT GTA GCC ATA 1429 
Thr Ala Ser Gly Asp Trp Asn Thr Thr He Ala Cys Phe Val Ala He 
400 405 410 

CTG ATC GGC CTG TGC CTT ACA TTA CTC CTG CTC GCC ATT TTC AAG AAA 1477 
Leu He Gly Leu Cys Leu Thr Leu Leu Leu Leu Ala lie Phe Lys Lys 
415 420 425 430 

GCG TTG CCA GCC CTC CCC ATC TCC ATC ACC TTC GGG CTC GTG TTC TAC 152.5 
Ala Leu Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr 
435 - - • 440 445 

TTC GCC ACG GAT TAC CTT GTG CAG CCC TTC ATG GAC CAA CTT GCA TTC 1573 
Phe Ala Thr Asp Tyr Leu Val Gin Pro Phe Met Asp Gin Leu Ala Phe 
450 455 460 

CAT CAG TTT TAT ATC TAGCCTTTCT GCAGTTAGAA CATGGATGTT TCTTCTTTGA 1628 
His Gin Phe Tyr He 
465 



TTATCAAAAA CACAAAAACA 


GAGAGCAAGC 


CCGAGGAGGA 


GACTGGT6AC 


TTTCCTGTGT 


1688 


CCTCAGCTAA CAAAGGCAG6 


ACTCCAGCTG 


GACTTCTGCA 


GCTTCCTTCC 


GAGTCTCCCT 


1748 


AGCCACCCGC ACTACTGGAC 


TGTGGAA6GA AGCGTCTACA 


GAGGAACGGT 


TTCCAACATC 


1808 


CATCGCTGCA 6CAGACGGTG 


TCCCTCAGTG 


ACTTGAGAGA 


CAAGGACAAG 


GAAATGTGCT 


1868 


GGGCCAAGGA GCTGCCGTGC 


TCTGCTAGCT 


TTGACCGTGG 


GCATGGAGAT 


TTACCC6CAC 


1928 


TGTGAACTCT CTAAGGTAAA 


CAAAGTGAGG 


TGAACC 






1964 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 £7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Thr Glu He Pro Ala Pro Leu Ser Tyr Phe Gin Asn Ala Gin Met 
15 10 15 

Ser Glu Asp Ser His Ser Ser Ser Ala He Arg Ser Gin Asn Asp Ser 
20 25 30 

Gin Glu Arg Gin Gin Gin His Asp Arg Gin Arg Leu Asp Asn Pro Glu 
35 40 45 

Pro He Ser Asn Gly Arg Pro Gin Ser Asn Ser Arg Gin Val Val Glu 
50 55 60 

Gin Asp Glu Glu Glu Asp Glu Glu Leu Thr Leu Lys Tyr Gly Ala Lys 
65 70 75 80 

His Val He Met Leu Phe Val Pro Val Thr Leu Cys Met Val Val Val 
85 90 95 

Val Ala Thr He Lys Ser Val Ser Phe Tyr Thr Arg Lys Asp Gly Gin 
100 105 110 

Leu He Tyr Thr Pro Phe Thr Glu Asp Thr Glu Thr Val Gly Gin Arg 
115 120 125 

Ala Leu His Ser He Leu Asn Ala Ala He Met He Ser Val He Val 
130 135 140 

He Met Thr He Leu Leu Val Val Leu Tyr Lys Tyr Arg Cys Tyr Lys 
145 150 155 160 

Val He His Ala. Trp Leu He He Ser Ser Leu Leu Leu Leu Phe Phe 
165 170 175 

Phe Ser Phe He Tyr Leu Gly Glu Val Phe Lys Thr Tyr Asn Val Ala 
180 185 190 

Val Asp Tyr Val Thr Val Ala Leu Leu He Trp Asn Phe Gly Val Val 
195 200 205 

Gly Met He Ala lie His Trp Lys Gly Pro Leu Arg Leu Gin Gin Ala 
210 215 220 

Tyr Leu He Net He Ser Ala Leu Met Ala Leu Val Phe He Lys Tyr 
225 230 235 240 

Leu Pro Glu Trp Thr Ala Trp Leu He Leu Ala Val He Ser Val Tyr 
245 250 255 

Asp Leu Val Ala Val Leu Cys Pro Lys Gly Pro Leu Arg Met Leu Val 
260 265 270 

Glu Thr Ala Gin Glu Arg Asn Glu Thr Leu Phe Pro Ala Leu He Tyr 
275 280 285 

Ser Ser Thr Met Val Trp Leu Val Asn Met Ala Glu Gly Asp Pro Glu 
290 295 300 

Ala Gin Arg Arg Val Pro Lys Asn Pro Lys Tyr Asn Thr Gin Arg Ala 
305 310 315 320 

Glu Arg Glu Thr Gin Asp Ser Gly Ser Gly Asn Asp Asp Gly Gly Phe 
325 330 335 
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Ser Glu Glu Trp Glu Ala Gin Arg Aap Ser His Leu Gly Pro Hia Arg 
340 345 350 

Ser Thr Pro Glu Ser Arg Ala Ala Val Gin Glu Leu Ser Gly Ser lie 
355 360 365 

Leu Thr Ser Glu Asp Pro Glu Glu Arg Gly Val Lys Leu Gly Leu Gly 
370 375 3B0 

Asp Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ala Thr Ala 
365 390 395 400 

Ser Gly Asp Trp Aan Thr Thr lie Ala Cys Phe Val Ala He Leu He 
405 410 415 

Gly Leu Cys Leu Thr Leu Leu Leu Leu Ala He Phe Lys Lys Ala Leu 
420 425 430 

Pro Ala Leu Pro He Ser He Thr Phe Gly Leu Val Phe Tyr Phe Ala 
435 440 445 

Thr Asp Tyr Leu Val Gin Pro Phe Met Asp Gin Leu Ala Phe His Gin 
450 455 460 

Phe Tyr He 
465 

(2) INFORMATION FOR SEQ ID NO: IB: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2229 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



( ix) FEATURE : 

(A) NAMB/KEY: CDS 

(B) LOCATION: 366.. 1712 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..2226 

(D) OTHER INFORMATION: /note= -hPS2- 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

GAATTCGGCA CGAGGGCATT TCCAGCAGTG AGGAGACAGC CAGAAGCAAG CTTTTGGAGC 60 

TGAAGGAACC TGAGACAGAA GCTA6TCCCC CCTCTGAATT TTACTGAT6A AGAAACTGAG 120 

GCCACAGAGC TAAAGTGACT TTTCCCAAGG TCGCCCAGCG AGQACGT6GG ACTTCTCAOA 180 

CGTCAGGAGA GT6ATGTGAG G6AGCTGTGT GACCATAGAA AGTGACGTGT TAAAAACCAG 240 

CGCTGCCCTC TTTGAAAGCC AGGGAGCATC ATTCATTTAG CCTGCTGAGA AGAAGAAACC 300 

AAGTGTCCG6 GATTCAAGAC CTCTCTQCOG CCCCAAGTGT TCGTGGTGCT TCCAGAGGCA 360 

GGGCT ATG CTC ACA TTC ATG GCC TCT GAC AGC GAG GAA GAA GTG TGT 407 
Met. Leu Thr Phe Met Ala Ser Asp Ser Glu Glu Glu Val Cys 
15 10 

GAT GAG CGG ACG TCC CTA ATG TCG GCC GAG AGC CCC ACG CCG CGC TCC 455 
Asp Glu Arg Thr Ser Leu Met Ser Ala Glu Ser Pro Thr Pro Arg Ser 
15 20 25 30 

T6C CA6 6AQ GGC AGG CAG GGC CCA GAG GAT GGA GAG AAT ACT GCC CAG 503 
Cys Gin Glu Gly Arg Gin Gly Pro Glu Asp Gly Glu Asn Thr Ala Gin 
35 40 45 
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TG6 AGA AGC CA6 GAG AAC GAG GAG GAC GGT GAG GAG 6AC CCT GAC CGC 551 
Trp Arg Ser Gin Glu Asn Glu Glu Asp Gly Glu Glu Aap Pro Asp Arg 

50 55 60 

TAT GTC TGT AGT GGG GTT CCC GGG C6G CCG CCA GGC CTG GAG GAA GAG 599 
Tyz Val Cys Ser Gly Val Pro Gly Arg Pro Pro Gly Leu Glu Glu Glu 
65 70 75 

CTG ACC CTC AAA TAC GGA GCG AAG CAT GTG ATC ATG CTG TTT GTG CCT 647 
Leu Thr Leu Lys Tyr Gly Ala Lys His Val lie Met Leu Phe Val Pro 
BO 85 90 

GTC ACT CTG TGC ATG ATC GTG GTG GTA GCC ACC ATC AAG TCT GTG CGC 695 
Val Thr Leu Cys Met He Val Val Val Ala Thr He Lys Ser Val Arg 
95 100 105 HO 

TTC TAC ACA GAG AAG AAT GGA CAG CTC ATC TAC ACG CCA TTC ACT GAG 743 
Phe Tyr Thr Glu Lys Asn Gly Gin Leu He Tyr Thr Pro Phe Thr Glu 
115 120 125 

GAC ACA CCC TCG GTG GGC CAG CGC CTC CTC AAC TCC GTG CTG AAC ACC 791 
Asp Thr Pro Ser Val Gly Gin Arg Leu Leu Asn Ser Val Leu Asn Thr • 
130 135 140 

CTC ATC ATG ATC AGC GTC ATC GTG GTT ATG ACC ATC TTC TTG GTG GTG 839 
Leu He Met He Ser Val He Val Val Met Thr He Phe Leu Val Val 
145 150 155 

CTC TAC AAG TAC CGC TGC TAC AAG TTC ATC CAT GGC TGG TTG ATC ATG 887 
Leu Tyr Lys Tyr Arg Cys Tyr Lys Phe He His Gly Trp Leu He Met 
160 165 170 

TCT TCA CTG ATG CTG CTG TTC CTC TTC ACC TAT ATC TAC CTT GGG GAA 935 
Ser Ser Leu Met Leu Leu Phe Leu Phe Thr Tyr He Tyr Leu Gly Glu 
175 ISO IBS 190 

GTG CTC AAG ACC TAC AAT GTG GCC ATG GAC TAC CCC ACC CTC TTG CTG 983 
Val Leu Lys Thr Tyr Asn Val Ala Met Aap Tyr Pro Thr Leu Leu Leu 
195 200 205 

ACT GTC TGG AAC TTC GGG GCA GTG GGC ATG GTG TGC ATC CAC TGG AAG 1031 
Thr Val Trp Asn Phe Gly Ala Val Gly Met Val Cys He His Trp Lys 
210 215 220 

GGC CCT CTG GTG CTG CAG CAG GCC TAC CTC ATC ATG ATC AGT GCG CTC 1079 
Gly Pro Leu Val Leu Gin Gin Ala Tyr Leu He Met He Ser Ala Leu 
225 230 235 

ATG GCC CTA GTG TTC ATC AAG TAC CTC CCA GAG TGG TCC GCG TGG GTC 1127 
Met Ala Leu Val Phe He Lys Tyr Leu Pro Glu Trp Ser Ala Trp Val 
240 245 250 

ATC CTG GGC GCC ATC TCT GTG TAT GAT CTC GTG GCT GTG CTG TGT CCC 1175 
He Leu Gly Ala He Ser Val Tyr Asp Leu Val Ala Val Leu Cys Pro 
255 260 265 270 

AAA GGG CCT CTG AGA ATG CTG GTA GAA ACT GCC CAG GAG AGA AAT GAG 1223 
Lys Gly Pro Leu Arg Met Leu Val Glu Thr Ala Gin Glu Arg Asn Glu 
275 280 285 

CCC ATA TTC CCT GCC CTG ATA TAC TCA TCT GCC ATG GTG TGG ACG GTT 1271 
Pro He Phe Pro Ala Leu He Tyr Ser Ser Ala Met Val Trp Thr Val 
290 295 300 

GGC ATG GCG AAG CTG GAC CCC TCC TCT CAG GGT GCC CTC CAG CTC CCC 1319 
Gly Met Ala Lys Leu Asp Pro Ser Ser Gin Gly Ala Leu Gin Leu Pro 
305 310 315 

TAC GAC CCG GAG ATG GAA GAA GAC TCC TAT GAC AGT TTT GGG GAG CCT 1367 
Tyr Aap Pro Glu Met Glu Olu Asp Ser Tyr Asp Ser Phe Gly Glu Pro 
320 325 330 
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TCA TAC CCC GAA GTC TTT GAG CCT CCC TTG ACT GGC TAG CCA GGG GAG 14X5 
Ser Tyr Pro Glu Val Phe Glu Pro Pro Leu Thr Gly Tyr Pro Gly Glu 
335 340 345 350 

GAG CTG GAG GAA GAG GAG GAA AGG GGC GTG AAG CTT GGC CTC GGG GAC 1463 
Glu Leu Glu Glu Glu Glu Glu Arg Gly Val Lys Leu Gly Leu Gly Asp 
355 360 365 

TTC ATC TTC TAC AGT GTG CTG GTG GGC AAG GCG GCT GCC ACG GGC AGC 1511 
Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ala Ala Thr Gly Ser 
370 375 380 

GGG GAC TGG AAT ACC ACG CTG GCC TOC TTC GTG GCC ATC CTC ATT GGC 1559 
Gly Asp Trp Aan Thr Thr Leu Ala Cys Phe Val Ala He Leu He Gly 
385 390 395 

TTG TGT CTG ACC CTC CTG CTG CTT GCT GTG TTC AAG AAG GCG CTG CCC 1607 
Leu Cya Leu Thr Leu Leu Leu Leu Ala Val Phe Lys Lys Ala Leu Pro 
400 405 4J.0 

GCC CTC CCC ATC TCC ATC ACG TTC GGG CTC ATC TTT TAC TTC TCC ACG 1655 
Ala Leu Pro He Ser He Thr Phe Gly Leu He Phe Tyr Phe Ser Thr 

415 420 425 430 

GAC AAC CTG GTG CGG CCG TTC ATG GAC ACC CTG GCC TCC CAT CAG CTC 1703 
Asp Asn Leu Val Arg Pro Phe Met Asp Thr Leu Ala Ser His Gin Leu 
435 440 445 

TAC ATC TGA GGGACATGGT GTGCCACAGG CTGCAAGCTG CA6GGAATTT 1752 
Tyr He ♦ 



TCATTGGATG 


CAGTTGTATA 


GTTTTACACT 


CTAGTGCCAT ATATTTTTAA GACTTTTCTT 


1812 


TCCTTAAAAA 


ATAAAGTACG 


TGTTTACTTO 


GTGAGGAGGA 


GGCAGAACCA 


GCTCTTTGGT 


1872 


GCCAGCTGTT 


TCATCACCAG 


ACTTTGOCTC 


CCGCTTTGGG 


GAGCGCCTCG 


CTTCACGGAC 


193:2 


AGGAAGCACA 


GCAGGTTTAT 


CCAGATGAAC 


TGAGAAGGTC 


AGATTAGGGT 


GGGGAGAAGA 


1992 


GCATCCGGCA 


TGAGGGCTGA 


GATGCCCAAA 


GAGTGTGCTC 


GGGAGTGGCC 


CCTGGCACCT 


2052 


GGGTGCTCTG 


GCTGGAGAGG 


AAAAGCCAGT 


TCCCTACGAG 


GAGTGTTCCC 


AATGCTTTGT 


2112 


CCATGATGTC 


CTTGTTATTT 


TATTNCCYTT 


ANAAACTGAN TCCTMTTNTT NTTDGGGCAG 


2172 


TCACMCTNCT 


GGGRAGTGGC 


TTAATAGTAA 


NATCAATAAA 


NAGNTGAGTC 


CTNTTAG 


2229 



<2} INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Het Leu Thr Phe Met Ala Ser Asp Ser Glu Glu Glu Val Cys Asp Glu 

15 10 15 

Arg Thr Ser Leu Het Ser Ala Glu Ser Pro Thr Pro Arg Ser Cys Gin 
20 25 30 

Glu Gly Arg Gin Gly Pro Glu Asp Gly Glu Asn Thr Ala Gin Trp Arg 
35 40 45 

Ser Gin Glu Asn Glu Glu Asp Gly Glu Glu Asp Pro Asp Arg Tyr Val 
50 55 60 



SUBSTTTUTE SHEET (RULE 25) 



wo 96/34099 



PCT/CA96/00263 



- 151 - 



Cys Ser Gly Val Pro 61y Arg Pro Pro Gly Leu Glu Glu Glu Leu Thr 
65 70 75 80. 

Leu Lys Tyr Gly Ala Lys His Val He Met Leu Phe Val Pro Val Thr 
85 90 95 

Leu Cys Met He Val Val Val Ala Thr He Lys Ser Val Arg Phe Tyr 
100 105 110 

Thr Glu Lys Aan Gly Gin Leu He Tyr Thr Pro Phe Thr Glu Asp Thr 
115 120 125 

Pro Ser Val Gly Gin Arg Leu Leu Asn Ser Val Leu Asn Thr Leu He 
130 135 140 

Met He Ser Val He Val Val Met Thr He Phe Leu Val Val Leu Tyr 
145 150 155 160 

Lya Tyr Arg Cys Tyr Lys Phe He His Gly Trp Leu He Met Ser Ser 
165 170 175 

Leu Met Leu Leu Phe Leu Phe Thr Tyr He Tyr Leu Gly Glu Val Leu 
180 185 190 

Lys Thr Tyr Asn Val Ala Met Asp Tyr Pro Thr Leu Leu Leu Thr Val 
195 200 205 

Trp Asn Phe Gly Ala Val Gly Met Val Cys He His Trp Lys Gly Pro 
210 215 220 

Leu Val Leu Gin Gin Ala Tyr Leu He Net He Ser Ala Leu Met Ala 
225 230 235 240 

Leu Val Phe He Lys Tyr Leu Pro Glu Trp Ser Ala Trp Val He Leu 
245 250 255 

Gly Ala He Ser Val Tyr Asp Leu Val Ala Val Leu Cys Pro Lys Gly 
260 265 270 

Pro Leu Arg Met Leu Val Glu Thr Ala Gin Glu Arg Asn Glu Pro He 
275 280 285 

Phe Pro Ala Leu He Tyr Ser Ser Ala Met Val Trp Thr Val Gly Met 

290 295 300 

Ala Lys Leu Asp Pro Ser Ser Gin Gly Ala Leu Gin Leu Pro Tyr Asp 
305 310 315 320 

Pro Glu Met Glu Glu Asp Ser Tyr Asp Ser Phe Gly Glu Pro Ser Tyr 
325 330 335 

Pro Glu Val Phe Glu Pro Pro Leu Thr Gly Tyr Pro Gly Glu Glu Leu 
340 345 350 

Glu Glu Glu Glu Glu Arg Gly Val Lys Leu Gly Leu Gly Asp Phe He 
355 360 365 

Phe Tyr Ser Val Leu Val Gly Lys Ala Ala Ala Thr Gly Ser Gly Asp 
370 375 380 

Trp Asn Thr Thr Leu Ala Cys Phe Val Ala He Leu He Gly Leu Cys 
385 390 395 400 

Leu Thr Leu Leu Leu Leu Ala Val Phe Lys Lys Ala Leu Pro Ala Leu 
405 410 415 

Pro He Ser He Thr Phe Gly Leu He Phe Tyr Phe Ser Thr Asp Asn 
420 425 430 



Leu Val Arg Pro Phe Met Asp Thr Leu Ala Ser His Gin Leu Tyr He 
435 440 445 
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(2) INFORMATION FOR SEQ ID NO: 20: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1B9S baae paira 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single. 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION.: 140.. 1762 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..X895 

(D) OTHER INFORMATION: /note* "DmPS" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
TATATGAOTC GCTTTAAAAC AAAA6AAAGT TTTTACCAGC TACATTCCTT TGCTTTCCTT €0 

AACTAAATCC CATCACACAA CTACGGCTTC GCAGGGGGAG GCGTCCAGCG CTACGGAGGC 120 

GAACGAACGC ACACCACTG ATG GCT GCT GTC AAT CTC CAG GCT TCG TGC TCC 172 
Met Ala Ala Val Asn Leu Gin Ala Ser Cys Ser 
15 10 

TCC GGG CTC GCC TCT GAG GAT GAC GCC AAT GTG GGC AGC CAG ATA GGC 220 
Ser Gly Leu Ala Ser Glu Asp Asp Ala Asn Val Gly Ser Gin lie Gly 
15 20 25 

GCG GCG GAG CGT TTG GAA CGA CCT CCA AGG CGG CAA CAG CAG CGG AAC 26S 
Ala Ala Glu Arg Leu Glu Arg Pro Pro Arg Arg Gin Gin Gin Arg Asn 
30 35 40 

AAC TAC GGC TCC AGC AAT CAG GAT CAA CCG GAT GCT GCC ATA CTT GCT 316 
Asn Tyr Gly Ser Ser Asn Gin Asp Gin Pro Asp Ala Ala lie Leu Ala 
45 50 55 

GTG CCC AAT GTG GTG ATG CGT GAA CCT TGT GGC TCG CGC CCT TCA AGA 364 
Val Pro Asn Val Val Met Arg Glu Pro Cys Gly Ser Arg Pro Ser Arg 
60 65 70 75 

CTG ACC GGT GGA GGA GGC GGC AGT GGT GGT CCG CCC ACA AAT GAA ATG 412 
Leu Thr Gly Gly Gly Gly Gly Ser Gly Gly Pro Pro Thr Asn Glu Met 
80 85 90 

GAG GAA GAG CAG GGC CTG AAA TAC GGG GCC CAG CAT GTG ATC AAG TTA 460 
Glu Glu Glu' Gin Gly Leu Lys Tyr Gly Ala Gin His Val He Lya Leu 
95 100 105 

TTC GTC CCC GTC TCC CTT TGC ATG CTG GTA GTG GTG GCT ACC ATC AAC 508 
Phe Val Pro Val Ser Leu Cys Net Leu Val Val Val Ala Thr He Asn 
110 115 120 

TCC ATC AGC TTC TAC AAC AGC ACG GAT GTC TAT CTC CTC TAC ACA CCT SS6 
Ser He Ser Phe Tyr Asn Ser Thr Asp Val Tyr Leu Leu Tyr Thr Pro 
125 130 135 

TTC CAT GAA CAA TCG CCC GAG CCT AGT GTT AAG TTC TGG AGT GCC TTG 604 
Phe His Glu Gin Ser Pro Glu Pro Ser Val Lys Phe Trp Ser Ala Leu 
140 145 150 155 

GCG AAC TCC CTG ATC CTG ATG AGC GTG GTG GTG GTG ATG ACC TTT TTG 652 
Ala Asn Ser Leu He Leu Met Ser Val Val Val Val Met Thr Phe Leu 
160 165 170 

CTG ATT GTT TTG TAC AAG AAG CGT TGC TAT CGC ATC ATT CAC GGC TGG 700 
Leu He Val Leu Tyr Lys Lys Arg Cys Tyr Arg He He His Gly Trp 
175 180 185 
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CTG ATT CTC TCC TCC TTC ATG TTG TTG TTC ATT TTT ACQ TAG TTA TAT 748 
Leu He Leu Ser Ser Phe Met Leu Leu Phe He Phe Thr Tyr Leu Tyr 
190 195 200 

TTG GAA GAG CTT CTT CGC GCC TAT AAC ATA CCG ATG GAG TAG OCT ACT 796 
Leu Glu Glu Leu Leu Arg Ala Tyr Aan He Pro Met Asp Tyr Pro Thr 
205 210 215 

GCA CTA CTG ATT ATG TGG AAC TTT GGA GTG GTC GGA ATG ATG TCC ATC 844 
Ala Leu Leu He Met Trp Asn Phe Gly Val Val Gly Met Met Ser He 
220 225 230 235 

CAT TGG CAG GGA CCT CTG CGG TTG CAG CAA GGA TAT CTC ATT TTC GTG 892 
His Trp Gin Gly Pro Leu Arg Leu Gin Gin Gly Tyr Leu He Phe Val 
240 245 2S0 

GCA GCC TTG ATG GCC TTG GTG TTC ATT AAA TAG CTG CCT GAA TGG ACT 940 
Ala Ala Leu Met Ala Leu Val Phe He Lys Tyr Leu Pro Glu Trp Thr 
255 250 2fi5 

GCC TGG GCT OTA TTG OCT GCC ATT TCT ATT TOO GAT CTT ATT OCT CTC 988 
Ala Trp Ala Val Leu Ala Ala He Ser He Trp Asp Leu He Ala Val 
270 275 280 

CTT TCG CCA AGA GGA CCC CTC CGC ATT CTG GTG GAA ACG GCT CAG GAG 1036 
Leu Ser Pro Arg Gly Pro Leu Arg He Leu Val Glu Thr Ala Gin Glu 
285 290 295 

CGA AAT GAG CAA ATC TTC CCC GCT CTG ATT TAT TCA TCC ACT GTC GTT 1084 
Arg Asn Glu Gin He Phe Pro Ala Leu He Tyr Ser Ser Thr Val Val 
300 305 310 315 

TAC GCA CTT GTA AAC ACT GTT ACG CCG CAG CAA TCG CAG GCC ACA GCT 1132 
Tyr Ala Leu Val Asn Thr Val Thr Pro Gin Gin Ser Gin Ala Thr Ala 
320 325 330 

TCC TCC TCG CCO TCG TCC AGC AAC TCC ACC ACA ACC AGO AGO GCC ACG 1180 
Ser Ser Ser Pro Ser Ser Ser Asn Ser Thr Thr Thr Thr Arg Ala Thr 
335 340 345 

CAG AAC TCG CTG GCT TCG CCA GAG GCA GCA GCG GCT AGT GGC CAA CGC 1228 
Gin Asn Ser Leu Ala Ser Pro Glu Ala Ala Ala Ala Ser Gly Gin Arg 
350 355 360 

ACA GGT AAC TCC CAT CCT CGA CAG AAT CAG CGG GAT GAG GGC AGT GTA 1276 
Thr Gly Asn Ser Kis Pro Arg Gin Asn Gin Arg Asp Asp Gly Ser Val 
365 370 375 

CTG GCA ACT GAA GGT ATG CCA CTT GTG ACT TTT AAA AGC AAT TTG CGC 1324 
Leu Ala Thr Glu Gly Met Pro Leu Val Thr Phe Lys Ser Asn Leu Arg 
380 385 390 395 

GGA AAC GCT GAG GCT GCG GGT TTC ACG CAA GAG TGG TCA GCT AAC TTG 1372 
Gly Asn Ala Glu Ala Ala Gly Phe Thr Gin Glu Trp Ser Ala Asn Leu 
400 405 410 

AGC GAA CGT GTG GCT CGT CGC CAG ATT GAA GTT CAA AGT ACT CAG AGT 1420 
Ser Glu Arg Val Ala Arg Arg Gin He Glu Val Gin Ser Thr Gin Ser 
415 420 425 

GGA AAC GCT CAG CGC TCC AAC GAG TAT AGO ACA GTA ACA GCT CCG GAT 1468 
Gly Asn Ala Gin Arg Ser Asn Glu Tyr Arg Thr Val Thr Ala Pro Asp 
430 435 440 

CAG AAT CAT CCG GAT GGG CAA GAA GAA CGT GGC ATA AAG CTT GGC CTC 15X6 
Gin Asn His Pro Asp Gly Gin GlU Glu Arg Gly He Lys Leu Gly Leu 
445 450 455 

GGC GAC TTC ATC TTC TAC TCG GTA TTA GTG GGC AAG GCC TCC AGC TAC 1564 
Gly Asp Phe He Phe Tyr Ser Val Leu Val Gly Lys Ala Ser Ser Tyr 
460 465 470 475 



SUBSTITUTE SHEET (RULE 26) 



wo 9d/34099 



PCT/CA96/00263 



- 154 - 



GGC GAC TGG ACQ ACC ACA ATC OCT TGC TTT GTG GCC ATC CTC ATT GGA 1612 
Gly Asp Trp Thr Thr Thr He Ala Cys Pbe Val Ala He Leu He Gly 
480 485 490 

CTC TGC CTC ACT CTT CTG CTT CTG GCC ATT TGG CGC AAG GCG CTA CCC 1660 
Leu Cys Leu Thr Leu Leu Leu Leu Ala He Trp Arg Lys Ala Leu Pro 
495 500 SOS 

GCC CTG CCC ATC TCA ATA ACG TTC GGA TTG ATA TTT TGC TTC GCC ACT 1708 
Ala Leu Pro He Ser He Thr Phe Gly Leu He Phe Cys Phe Ala Thr 
510 515 520 

AGT GCG GTG GTC AAG CCG TTC ATG GAG GAT CTA TCG GCC AAG CAG GTG 1756 
Ser Ala Val Val Lys Pro Phe Met Glu Asp Leu Ser Ala Lys Gin Val 
525 530 535 

TTT ATA TAAACTTGAA AAGACAAGGA CACATCAAGT GTCTTACAGT ATCATAGTCT 1812 

Phe He 

540 

AACAAAGCTT TTTGTAATCC AATTCTTTAT TTAACCAAAT GCATAGTAAC AACCTCGACT 1872 
AAAAAAAAAA AAAAAAAAAA AAA 1895 

(2) INFORMATION FOR SEQ ID NO: 21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 amino acids 

(B) TYPE: amino acid 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Net Ala Ala Val Asn Leu Gin Ala Ser Cys Ser Ser Gly Leu Ala Ser 
1 5 10 IS 

Glu Asp Asp Ala Asn Val Gly Ser Gin He Gly Ala Ala Glu Arg Leu 
20 25 30 

Glu Arg Pro Pro Arg Arg Gin Gin Gin Arg Asn Asn Tyr Gly Ser Ser 
35 40 45 

Asn Gin Asp Gin Pro Asp Ala Ala He Leu Ala Val Pro Asn val Val 
50 55 60 

Met Arg Glu Pro Cys Gly Ser Arg Pro Ser Arg Leu Thr Gly Gly Gly 
65 70 75 80 

Gly Gly Ser Gly Gly Pro Pro Thr Asn Glu Met Glu Glu Glu Gin Gly 
85 90 95 

Leu Lys Tyr Gly Ala Gin His Val He Lys Leu Phe Val Pro Val Ser 
100 105 110 

Leu Cys Met Leu Val Val Val Ala Thr He Asn Ser He Ser Phe Tyr 
115 120 125 

Asn Ser Thr Asp Val Tyr Leu Leu Tyr Thr Pro Phe His Glu Gin Ser 
130 135 140 

Pro Glu Pro Ser Val Lys Phe Trp Ser Ala Leu Ala Asn Ser Leu He 
145 150 155 160 

Leu Met Ser Val Val Val Val Met Thr Phe Leu Leu He Val Leu Tyr 
165 170 175 

Lys Lys Arg Cys Tyr Arg He He His Gly Trp Leu He Leu Ser Ser 
180 185 190 
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Phe Met Leu Leu Phe lie Phe Thr Tyr Leu Tyr Leu 61u 61u Leu Leu 
195 200 205 

Arg Ala Tyr Asn He Pro Net Asp Tyr Pro Thr Ala Leu Leu He Met 
210 215 220 

Trp Asn Phe Gly Val Val Gly Met Met Ser He His Trp Gin Gly Pro 
225 230 235 240 

Leu Arg Leu Gin Gin Gly Tyr Leu He Phe Val Ala Ala Leu Met Ala 
245 250 255 

Leu Val Phe He Lys Tyr Leu Pro Glu Trp Thr Ala Trp Ala Val Leu 
260 2C5 270 

Ala Ala He Ser He Trp Asp Leu He Ala Val Leu Ser Pro Arg Gly 
275 280 285 

Pro Leu Arg He Leu Val Glu Thr Ala Gin Glu Arg Asn Glu Gin He 
290 295 300 

Phe Pro Ala Leu He Tyr Ser Ser Thr Val Val Tyr Ala Leu Val Asn 

305 310 315 320 

Thr Val Thr Pro Gin Gin Ser Gin Ala Thr Ala Ser Ser Ser Pro Ser 
325 330 335 

Ser Ser Asn Ser Thr Thr Thr Thr Arg Ala Thr Gin Asn Ser Leu Ala 
340 345 350 

Ser Pro Glu Ala Ala Ala Ala Ser Gly Gin Arg Thr Gly Asn Ser His 

355 360 365 

Pro Arg Gin Asn Gin Arg Asp Asp Gly Ser Val Leu Ala Thr Glu Gly 
370 375 380 

Met Pro Leu Val Thr Phe Lys Ser Asn Leu Arg Gly Asn Ala Glu Ala 
385 390 395 400 

Ala Gly Phe Thr Gin Glu Trp Ser Ala Asn Leu Ser Glu Arg Val Ala 
405 410 415 

Arg Arg Gin He Glu Val Gin Ser Thr Gin Ser Gly Asn Ala Gin Arg 
420 425 430 

Ser Asn Glu Tyr Arg Thr Val Thr Ala Pro Asp Gin Asn His Pro Asp 
435 440 445 

Gly Gin Glu Glu Arg Gly He Lys Leu Gly Leu Gly Asp Phe He Phe 
450 455 460 

Tyr Ser Val Leu Val Gly Lys Ala Ser Ser Tyr Gly Asp Trp Thr Thr 
465 470 475 480 

Thr He Ala Cys Phe Val Ala He Leu He Gly Leu Cys Leu Thr Leu 
485 490 495 

Leu Leu Leu Ala He Trp Arg Lys Ala Leu Pro Ala Leu Pro He Ser 
500 505 510 

He Thr Phe Gly Leu He Phe Cys Phe Ala Thr Ser Ala Val Val Lys 

515 520 525 

Pro Phe Met Glu Asp Leu Ser Ala Lys Gin Val Phe He 
530 535 540 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQX7ENCE DESCRIPTION: SEQ ID NO: 22: 
CTNCCNGART GGACNGYCTG G 
(2) INFORMATION FOR SEQ ID NO: 23: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
RCANGCDATN GTNGTRTTCC A 
(2) INFORMATION FOR SEQ ID NO:24: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
TTTTTTCTCG AGACNGCNCA RGARAGAAAY GA 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
TTTTTTGGAT CCTARAADAT RAARTCNCC 29 
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CLAIMS 

What is claimed is: 

1. An isolated nucleic acid comprising a nucleotide sequence 
encoding a protein selected £rom the group consisting of a normal 
presenilin-l protein, a mutant presenilin-1 protein, a normal 
presenilin-2 protein, and a mutant presenilin-2 protein. 

2. An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a normal presenilin-1 protein and wherein said 
nucleotide sec[uence is selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-1 amino acid sequence of SEQ ID NO: 2; 

(2) a sequence encoding a protein comprising the human 
presenilin-1 amino acid sequence of SEQ ID NO: 4; 

(3) a sequence encoding a protein comprising the murine 
presenilin-1 amino acid sequence of SEQ ID NO: 17 

(4) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 2 wherein residue 257 is replaced by 
alanine and residues 258*290 are omitted; 

(5) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 4 wherein residue 253 is replaced by 
alanine and residues 254-286 are omitted; and 

(6) a sequence encoding a normal presenilin-1 protein and 
capable of hybridizing to a sequence complementary to any 
sequence of (1) - (5) under stringent hybridization conditions* 

3. An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a mutant presenilin-l protein, 

wherein said nucleotide sequence encodes at least one 
mutation which corresponds to a mutation of SEQ ID NO: 2 selected 
from the group consisting of A79?, V82L,V96F, Y115H, .M139T, 
M139V, I143T, M146L, M146V, H163R, H163Y, L171P, G209V, I211T, 
A231T, A246E, A260V, C263R, P2S4L, P267S, E280A, E280G, A285V, 
L286V, ^291-319 1 G384A, L392V and C410Y; and 

wherein said nucleotide sequence otherwise corresponds to a 
nucleotide sequence selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-1 amino acid sequence of SEQ ID NO: 2; 

(2) a sequence encoding a protein comprising the human 
presenilin-1 amino acid sequence of SEQ ID NO: 4; 
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(3) a sequence encoding a protein comprising the murine 
presenilin-1 amino acid sequence of SEQ ID NO: 17; 

(4) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 2 wherein residue 257 is replaced by 
alanine and residues 258-290 are omitted; 

(5) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 4 wherein residue 253 is replaced by 
alanine and residues 254-286 are omitted; and 

(6) a sequence encoding a normal presenilin-l protein auid 
capable of hybridizing to a sequence complementary to any 
sequence of (i) - (5) under stringent hybridization conditions. 

4. An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a mutant presenilin-l protein, 

wherein said nucleotide sequence encodes at least one 
mutation which corresponds to a mutation of SEQ ID NO: 19 
selected from the group consisting of M239V« N141I and I420T; and 

wherein said nucleotide sequence otherwise corresponds to a 
nucleotide sequence selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-1 amino acid secjuence of SEQ ID NO: 2; 

(2) a sequence encoding a protein comprising the human 
presenilin-l amino acid sequence of SEQ ID NO: 4; 

(3) a sequence encoding a protein comprising the murine 
presenilin-l amino acid sequence of SEQ ID NO: 17; 

(4) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 2 wherein residue 257 is replaced by 
alanine and residues 258-290 are omitted; 

(5) a sequence encoding a protein comprising the amino acid 
of sequence of SEQ ID NO: 4 wherein residue 253 is replaced by 
alanine and residues 254-286 are omitted; and 

(6) a sequence encoding a normal presenilin-1 protein and 
capable of hybridizing to a sequence complementary to any 
secjuence of (!) - (5) under stringent hybridization conditions. 

5. An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a normal presenilin-2 protein and wherein said 
nucleotide sequence is selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-2 amino acid sequence of SEQ ID NO: 19; 
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(2) a sequence encoding a protein comprising the human 
presenilin-2 amino acid sequence of SEQ.ID NO: 19 wherein 
residues 263-296 are omitted; and 

(3) a sequence encoding a normal presenilin-2 protein and 
capable of hybridizing to a sequence complementary to any one of 
sequences (1) - (2) under stringent hybridization conditions. 

6 . An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a mutant presenilin-2 protein, 

wherein said nucleotide sequence encodes at least one 
mutation which corresponds to a mutation of SEQ ID NO: 19 
selected from the group consisting of M239V, N141I and I420T; and 

wherein said nucleotide sequence otherwise corresponds to a 
nucleotide sequence selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-2 amino acid sequence of SEQ ID NO: 19; 

(2) a sequence encoding a protein con^rising the hiiman 
presenilin-2 amino acid of sequence of SEQ ID NO: 19 wherein 
residues 263-296 are omitted; and 

(3) a sequence encoding a normal presenilin-2 protein and 
capable of hybridizing to a seG[uence complementary to any 
sequence of (1) - (2) under stringent hybridization conditions. 

7 . An isolated nucleic acid as in claim 1 wherein said nucleic 
acid encodes a mutant pre8enilin-2 protein, 

wherein said nucleotide sequence encodes at least one 
mutation which corresponds to a mutation of SEQ ID NO: 2 selected 
from the group consisting of A79?, V82L,V96F, Y115H, M139T, 
M139V, I143T, M146L, M146V, H163R, H163Y, L171P, G209V, I211T, 
A231T, A246E, A260V, C263R, P264L, P267S, E280A, E280G, A285V, 
L286V, A291-319, G384A, L392V and C410Y; and 

wherein said nucleotide sequence otherwise corresponds to a 
nucleotide sequence selected from the group consisting of 

(1) a sequence encoding a protein comprising the human 
presenilin-2 amino acid sequence of SEQ ID NO: 19; 

(2) a sequence encoding a protein comprising the human 
presenilin-2 amino acid of sequence of SEQ ID NO: 19 wherein 
residues 263-296 are omitted; and 

(3) a sequence encoding a normal presenilin-2 protein and 
capable of hybridizing to a sequence complementary to euiy 
sequence of (1) - (2) under stringent hybridization conditions. 
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8. An isolated nucleic acid comprising a nucleotide sequence of 
at least 10 consecutive nucleotides selected from the group 
consisting SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 
13, SEQ ID NO: 14, SEQ ID NO: 15, and a sequence complementary to 
any of these sequences. 

9. An isolated nucleic acid comprising a nucleotide sequence of 
at least 15 consecutive nucleotides selected from the group 
consisting SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 
13, SEQ ID NO: 14, SEQ ID NO; 15, and a sequence complementary to 
any of these sequences. 

^10. An isolated nucleic acid comprising a nucleotide sequence of 
at least 20 consecutive nucleotides selected from the group 
consisting SEQ ID NO: 1, SBQ ID NO: 3, SEQ ID NO: 16, SEQ ID NO: 
18, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 
13, SEQ ID NO: 14, SEQ ID NO: 15, and a sequence complementary to 
any of these sequences. 

11. An isolated nucleic acid comprising a nucleotide sequence 
comprising at least 10 consecutive nucleotides from a presenilin 
insert in a plasmid selected from the group consisting of ATCC 
Accession # 97214, ATCC Accession # 97508, ATCC Accession # 97124 
and ATCC Accession # 97428. 

12. An isolated nucleic acid comprising a nucleotide sequence 
encoding at least one functional domain of a presenilin protein 
selected from the group consisting of a normal presenilin-1 
protein, a mutant presenilin-l protein, a normal presenilin«-2 
protein, and a mutsuit presenilin* 2 protein. 

13 . An isolated nucleic acid as in claim. .12 wherein said 
fiinctional domain is a presenilin-l functional domain 
corresponding to a domain selected from the group consisting of a 
presenilin-l N- terminal. TMl, TMl-^2, TM2, TM2->3, TM3. TM3-^, TM4, 
TM4-»5, TM5, TM5-»6, TM6, TM6-»7, TM7, and C- terminal domain. 

14. An isolated nucleic acid as in claim 12 wherein said 
functional domain is a presenilin-2 functional domain 
corresponding to a domain selected from the group consisting of a 
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presenilin-2 N- terminal, TMl, TMl-#2, TM2, TM2-*3, TM3, TM3-#4, TM4, 
TM4-*5, TM5, TM5->6 , TM6 , TM6->7, TM7, and C- terminal domain. 

15. An isolated nucleic acid comprising a nucleotide sequence 
encoding an antigenic determinant o£ a presenilin protein 
selected from the group consisting of a normal presenilin- 1 
protein, a mutant presenilin-1 protein, a normal presenilin-2 
protein, and a mutcint presenilin- 2 protein. 

16. An isolated nucleic acid as in claim 15, wherein said 
sequence encodes a presenilin- 1 antigenic determinant 
corresponding to a presenilin-1 antigenic determinauit selected 
from the group consisting of amino acid residues 27-44, 28-61, 
46-48, 50-60, 65-71, 66-67, 107-111, 109-112, 120-121, 120-122, 
125-126, 155-160, 185-189, 214-223, 218-221, 220-230, 240-245, 
241-243, 267-269, 273-282, 300-370, 302-310, 311-325, 332-342, 
346-359, 372-382, 400-410 and 400-420 of SEQ ID NO: 2. 

17. An isolated nucleic acid as in claim 15, wherein said 
sequence encodes a presenilin-2 antigenic determinant 
corresponding to a presenilin-2 antigenic determinauit selected 
from the group consisting of amino acid residues 25-45, 50-63, 
70-75, 114-120, 127-132, 162-167, 221-226, 282-290, 310-314, 321- 
338, 345-352, 380-390 and 430-435 of SEQ ID NO: 19. 

18. A method for identifying allelic variants or heterospecif ic 
homologues of a human presenilin gene comprising 

choosing a nucleic acid probe or primer capedDle of 
hybridizing to a hixman presenilin gene sequence xmder stringent 
hybridization conditions; 

mixing said probe or primer with a sample of nucleic acids 
which may contain a nucleic acid corresponding to said variant or 
homologue; 

detecting hybridization of said probe or primer to said 
nucleic acid corresponding to said variant or homologue. 

19. A method as in claim 18 wherein said sample comprises a 
sample of nucleic acids selected from the group consisting of 
human genomic DNA, human mRNA, and human cDNA. 

20. A method as in claim 18 wherein said sample comprises a 
sample of nucleic acids selected from the group consisting of 
mammalian genomic DNA, mammalieux mRNA, and mammalian cDNA. 

21. A method as in claim 18 wherein said sample comprises a 
sample of nucleic acids selected from the group consisting of 
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invertebrate genomic DNA, invertebrate mRNA, and invertebrate 
cDNA. 

22. A method as in claim 18 further comprising the step of 
isolating said nucleic acid corresponding to said variant or 
homologue. 

23. A method as in claim 18 wherein said nucleic acid is 
identified by hybridization. 

24. A method as in claim 18 wherein said nucleic acid is 
identified by PGR amplification. 

25. A method for identifying allelic variants or heterospecific 
homologues of a humaui presenilin gene comprising 

choosing an antibody capable of selectively binding to a 
hximan presenilin protein; 

mixing said antibody with a sample of proteins which may 
contain a protein corresponding to said variant or homologue; 

detecting binding of said antibody to said protein 
corresponding to said variant or homologue. 

26 . A method as in claim 25 wherein said san^le comprises a 
sample of proteins selected from the group consisting of human 
proteins, human fusion proteins, and proteolytic fragments 
thereof . . 

27. A method as in claim 25 wherein said sample comprises a 
sample of nucleic acids selected from the group consisting of 
mammalian proteins, mammalian fusion proteins, and proteolytic 
fragments thereof.. 

26. A method as in claim 25 wherein said sample comprises a 
sample of nucleic acids selected from the group consisting of 
invertebrate proteins, invertebrate fusion proteins, and 
proteolytic fragments thereof.. 

29. A method as in claim 25 further comprising the step of 
substantially purifying said protein corresponding to variauit or 
homologue . 

30. An isolated nucleic acid comprising an allelic variant or a 
heterospecific homologue of a human presenilin gene. 

31. An isolated nucleic acid encoding an allelic variant or 
heterospecific homologue of a humeui presenilin protein. 

32. An isolated nucleic acid as in claim 31 wherein said nucleic 
acid encodes a Drosophila melanoaaster homologue of a human 
presenilin gene. 
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33. An isolated nucleic acid as in claim 32 wherein said nucleic 
acid comprises a nucleotide sequence selected from the group 
consisting of 

(1) a sequence encoding a protein comprising the DmPS amino 
acid sequence of SEQ ID NO: 21; 

(2) a. sequence encoding a presenilin homologue protein and 
capable of hybridizing to a sequence complementary to the 
sequence of (1) under stringent hybridization conditions. 

34. An isolated nucleic acid comprising a nucleotide sequence of 
at least 10 consecutive nucleotides selected from the group 
consisting of SEQ ID NO: 21 and a seq[uence complementary to SEQ 
ID NO: 21. . 

35. An isolated nucleic acid comprising a recombinamt vector 
including a nucleotide sequence of any one of claims 1-34. 

36. An isolated nucleic acid as in claim 35 wherein said vector 
is an expression vector and said presenilin nucleotide sequence 
is operably joined to a regulatory region. 

37. An isolated nucleic acid as in claim 36 wherein said 
expression vector may express said presenilin sequence in 
mammalian cells. 

38. An isolated nucleic acid as in claim 37 wherein said cells 
are selected from the group consisting of fibroblast, liver, 
kidney, spleen, bone marrow and neurological cells. 

39. An isolated nucleic acid as in claim 37 wherein said vector 
is selected from the group consisting of vaccinia virus, 
adenovirus, retrovirus, neurotropic viruses and Herpes simplex. 

40. An isolated nucleic acid as in claim 36 wherein said 
expression vector encodes at least a fiinctional domain of a 
presenilin protein selected from the group consisting of normal 
presenilin-1, mutant presenilin- 1., normal presenilin-2, and 
mutant presenilin-2. 

41. An isolated nucleic acid as in claim 36 wherein said vector 
further comprises sequences encoding an exogenous protein 
operably joined to said presenilin sequence and whereby said 
vector encodes a presenilin fusion protein. 

42. An isolated nucleic acid as in claim 41 wherein said 
exogenous protein is selected from the group consisting of lacZ, 
trpE, maltose-binding protein, poly-His tags or glutathione- S - 
transferase. 
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43 . An isolated nucleic acid conprising a recombinant expression 
vector including nucleotide sequences corresponding to an 
endogenous regulatory region of a presenilin gene. 

44. An isolated nucleic acid as in claim 43 wherein said 
endogenous regulatory region is operably joined to a marker gene. 

45. A host cell transformed with an expression vector o£ any one 
o£ claims 36->44, or a descendant thereof. 

46. A host cell as in claim 45 wherein said host cell is 
selected from the group consisting of bacterial cells and yeast 
cells . 

47. A host cell as in claim 45 wherein said host cell is 
selected from the group consisting of fetal cells, embryonic stem 
cells, zygotes I gametes, and germ line cells. 

48. A host cell as in claim 45 wherein said cell is selected 
from the group consisting of fibroblast, liver, kidney, spleen, 
bone marrow and neurological cells. 

49. A host cell as in claim 45 wherein said cell is an 
invertebrate cell. 

50. A non-human animal model for Alzheimer's Disease, wherein a 
genome of said aoiimal, or an ancestor thereof, has been modified 
by at least one recombinant construct, and wherein said 
recombinamt construct has introduced a modification selected from 
the group consisting of (1) insertion of nucleotide sequences 
encoding at least a functional domain of a heterospecif ic normal 
presenilin gene, (2) insertion of nucleotide sequences encoding 
at least a fxmctional domain of a heterospecif ic mutant 
presenilin gene, (3) insertion of nucleotide sequences encoding 
at least a functional domain of a conspecific homologue of a 
heterospecif ic mutant presenilin gene, and (4) inactivation of an 
endogenous presenilin gene. 

51. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least a functional 
domain of a normal h\iraan presenilin- 1 gene. 

52. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least a functional 
domain of a mutant human presenilin-1 gene. 

53. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least a functional 
domain of a normal human presenilin- 2 gene. 
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54. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least' a ftinctional 
domain of a mutouit humam presenilin-2 gene. 

55. An animal as in claim 50 wherein said modification is 
insertion of a nucleotide sequence encoding at least a functional 
domain of a normal or mutant human presenilin protein. 

56. An animal as in claim 50 wherein said animal is selected 
from the group consisting of rats, mice, hamsters, guinea pigs, 
rabbits, dogs, cats, goats, sheep, pigs, and non*human primates. 

57. An animal as in claim 50 wherein said auiimal is an 
invertebrate . 

58. , A method for producing at least a functional domain of a 
presenilin protein comprising culturing a host cell of any of 
claims 45-49 under suitable conditions to produce said presenilin 
by expressing said nucleic acid. 

59. A substantially pure preparation of a protein selected from 
the group consisting of a normal presenilin-l protein, a mutant 
presenilin- r protein, a normal presenilin-2 protein, and a mutant 
presenilin-2 protein. 

60. A substantially pure preparation as in claim 59 wherein said 
protein comprises a normal presenilin-l protein selected from the 
group consisting of 

(1) a protein comprising the amino acid sequence of SEQ ID 

NO: 2; 

(2) a protein comprising the amino acid sequence of SEQ ID 

NO: 4; 

(3) a protein comprising the amino acid sequence of SEQ ID 
NO: 17; 

(4) a protein comprising the amino acid of sequence of SEQ 
ID NO: 2 wherein residue 257 is replaced by alanine and residues 
258-290 are omitted; and 

(5) a protein con^rising the amino acid of sequence of SEQ 
ID NO: 4 wherein residue 253 is replaced by alanine and residues 
254-286 are omitted. 

61. A substantially pure preparation as in claim 59 wherein said 
protein comprises a mutant presenilin-l protein including at 
least one mutation which corresponds to a mutation of SEQ ID NO: 

2 selected from the group consisting of A79?, V82L,V96F, yil5H, 
M139T, M139V, I143T, M146L.^ M146V, H163R, H163Y, L171P, G209V, 
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I211T, A231T, A246E, A260V, C263R, P264L, P267S, E280A, E280G, 
A2B5V, L286V, A291-319, G384A, L392V and C410Y; and 

wherein said protein otherwise corresponds to an amino acid 
sequence selected from the group consisting of 

(1) an amino acid sequence of SEQ ID NO: 2; 

(2) an amino acid sequence of SEQ ID NO: 4; 

(3) an amino acid sequence of SEQ. ID NO: 17; 

(4) an amino acid of sequence of SEQ ID NO: 2 wherein 
residue 257 is replaced by alanine and residues 258-290 are 
omitted; and 

(4) an amino acid of sequence of SEQ ID NO: 4 wherein 
residue 253 is replaced by alanine and residues 254-286 are 
omitted. 

62. A substantially pure preparation as in claim 59 wherein said 
protein comprises a normal presenilin-2 protein selected from the 
grottp consisting of 

(1) a protein comprising the amino acid sequence of SEQ ID 
NO: 19; and 

(2) a protein comprising the amino acid of sequence of SEQ 
ID NO: 19 wherein residues 263-296 are omitted. 

63. A substantially pure preparation as in claim 59 wherein said 
protein con^rises a mutant presenilin-2 protein including at 
least one mutation which corresponds to a mutation of SEQ ID NO: 
19 selected from the group consisting of M239V, N141I and 14207; 
and 

wherein said protein otherwise corresponds to an amino acid 
sequence selected from the group consisting of 

(1) an. amino acid sequence of SEQ ID NO: 19; and 

(2) an amino acid of sequence of SEQ ID NO: 19 wherein 
residues 263-296 are omitted. 

64. A substantially pure preparation of a polypeptide comprising 
an amino acid sequence of at least 5 consecutive amino acid 
residues selected from the group consisting SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 17, SEQ ID NO: 19, and SEQ ID NO: 21. 

65. A substantially pure preparation of a polypeptide comprising 
an amino acid sequence of at least 10 consecutive amino acid 
residues selected from the group consisting SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 17, SEQ ID NO: 19, and SEQ ID NO: 21. 

66. A substantially pure preparation of a polypeptide comprising 
an amino acid sequence of at least 15 consecutive amino acid 
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residues selected from the group consisting SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 17, SEQ ID NO: 19, and SEQ ID NO: 21. 

67. A substantially pure preparation of a polypeptide comprising 
at least one functional domain of a presenilin protein selected 
from the group consisting of a normal presenilin-l protein, a 
mutant presenilin- 1 protein, a normal presenilin-2 protein, and a 
mutant presenilin- 2 protein. 

68. A substfiuitially pure preparation as in claim 67 wherein said 
fimctional domain is a presenilin- 1 functional domain 
corresponding to a domain selected from the group consisting of a 
presenilin- 1 N- terminal, TMl, TMl-»2, TM2, TM2-»3, TM3, TN3-»4, TM4, 

, TM4-»5, TM5, TM5-»6, TM6 , TN6-»7, TM7, and C- terminal domain. 

69. A substcuitially pure preparation as in claim 67 wherein said 
functional domain is a presenilin-2 functional domain 
corresponding to a domain selected from the group consisting of a 
presenilin-2 N- terminal, TMl, TMl-»2, TM2, TM2-*3, TM3, TM3-^, TM4, 
TM4-»5, TM5, TM5->6, TM6 , TM6-»7, TM7, and C- terminal domain. 

70. A sxabstantially pure preparation of a polypeptide comprising 
an antigenic determinant of a presenilin protein selected from 
the group consisting of a normal presenilin- 1 protein, a mutant 
presenilin-1 protein, a normal presenilin-2 protein, and a mutant 
presenilin-2 protein. 

71. A substantially pure preparation as in claim 70, wherein 
said polypeptide comprises a presenilin-1 antigenic determinant 
corresponding to a presenilin- 1 antigenic determinant selected 
from the group of nucleotide consisting of amino acid residues 
27-44, 28-61, 46-48, 50-60, 65-71, 66-67, 107-111, 109-112, 120- 
121, 120-122, 125-126, 155-160, 185-189, 214-223, 218-221, 220- 
230, 240-245, 241-243, 267-269, 273-282, 300-370, 302-310, 311- 
325, 332-342, 346-359, 372-382, 400-410 and 400-420 of SEQ ID NO: 
2. . 

72. A substantially pure preparation as in claim 70, wherein 
said polypeptide comprises , a presenilin-1 antigenic determin2Lnt 
corresponding to a presenilin- l antigenic determinant selected 
from the group of nucleotide consisting of amino acid residues 
25-45, 50-63, 70-75, 114-120, 127-132, 162-167, 221-226, 282-290, 
310-314, 321-338, 345-352, 380-390 and 430-435 of SEQ ID NO: 19. 

73. A method of producing euitibodies which selectively bind to a 
presenilin comprising the steps of 
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administering an invnunogenically effective amount of a 
presenilin invnunogen to an animal; 

allowing said animal to produce antibodies to said 
immunogen ; and 

obtaining said antibodies from said animal or from a cell 
culture derived therefrom. 

74 . A substantially pure preparation of an antibody which 
selectively binds to an antigenic determinant of a presenilin 
protein selected from the group consisting of a normal 
presenilin- 1, a mutant presenilin- 1, a normal presenilin-2, and a 
mutant pre8enilin-2 . 

75. A substantially pure preparation of an antibody as in claim 
74 wherein said antibody selectively binds to an antigenic 
determinant of a mutant presenilin- 1 and fails to bind to a 
normal presenilin- 1 protein. 

76. A s\ibst£uitially pure preparation of an antibody as in claim 
74 wherein said cuitibody selectively binds to an antigenic 
determinaint of a mutant presenilin- 2 and fails to bind to a 
normal presenilin- 2 protein. 

77. A cell line producing an antibody of any one of claims 74- 
76. 

78. A method for identifying compounds which can modulate the 
expression of a presenilin gene comprising 

contacting a cell with a test candidate wherein said cell 
includes a regulatory region of a presenilin gene operably joined 
to a coding region; and 

detecting a chemge in expression of said coding region. 

79. A method as in claim 78 wherein said change comprises a 
change in a level of an mRNA trsmscript encoded by said coding 
region, 

80. A method as in claim 78 wherein said change comprises a 
change in a level of a protein encoded by said coding region. 

81. A method as in claim 78 wherein said change is a result of 
an activity of a protein encoded by said coding region. 

62. A method as in claim 78 wherein said coding region encodes a 
marker protein selected from the group consisting of j9- 
galactosidase, alkaline phosphatase, green fluorescent protein, 
and lucif erase. 

83 . A method for identifying compounds which can selectively 
bind to a presenilin protein comprising the steps of 
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providing a preparation including at least one presenilin 
component ; 

contacting said preparation with a sample including at least 
one candidate compound; and 

detecting binding of said presenilin component to said 
candidate compound. 

84 . The method in 83 wherein said binding to said presenilin 
component is detected by an assay selected from the group 
consisting of: affinity chromatography, co-immunoprecipitation, a 
Biomolecular Interaction Assay, and a yeast two-hybrid system. 

85. A method of identifying compounds which Ccui modulate 
activity of a presenilin comprising the steps of 

providing a cell expressing a normal or mutant presenilin 

gene ; 

contacting said cell with at least one candidate compound; 

and 

detecting a change in a marker of said activity. 

86. A method as in claim 85 wherein measurement of said marker 
indicates a difference between cells bearing an expressed mutant 
presenilin gene and otherwise identical cells free of an 
expressed mutant presenilin gene. 

87- A method as in claim 85 wherein said change comprises a 
change in a non-specific marker of cell physiology selected from 
the group consisting of pH, intracellular calcium, cyclic AMP 
levels, GTP/GDP ratios, phosphatidylinositol activity, and 
protein phosphorylation. 

88. A method as in claim 85 wherein said change comprises a 
change in expression of said presenilin. 

89. A method as in claim 85 wherein said change comprises a 
change in intracellular concentration or flux of an ion selected 
from the group consisting of Ca^*, Na* and K*. 

90. A method as in claim 35 wherein said change comprises a 
change in occurrence or rate of apoptosis or cell death. 

91. A method as in claim 85 wherein said change comprises a 
change in production of A/3 peptides* 

92. A method as in claim 85 wherein said change comprises a 
chsmge in phosphorylation of at least one microtubule associated 
protein. 

93. A method as in claim 85 wherein said cell is a cell cultured 
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94. A method as in claim 93 wherein said cell is a transfomed 
host cell of cmy one of claims 45-49. 

95. A method as in claim 93 wherein said cell is explanted from 
a host bearing at least one mutant presenilin gene. 

96. A method as in claim 93 wherein said cell is explanted from 
a transgenic animal of any one of claims 50-57. 

97. A method as in claim 85 wherein said cell is a cell in a 
live animal. 

98. A method as in claim 97 wherein said cell is a cell of a 
transgenic animal of any one of claims 50-57. 

99. A method as in claim 85 wherein said cell is in a hijman 
subject in a clinical trial. 

100. A diagnostic method for determining if a subject bears a 
mutant presenilin gene comprising the steps of 

providing a biological sample of said subjects- 
detecting in said sample a mutant presenilin nucleic acid, a 
mutant presenilin protein, or a mutant presenilin activity. 

101. A method as in claim 100, wherein a mutant presenilin 
nucleic acid is detected by an assay selected from the group 
consisting of direct nucleotide sequencing, probe specific 
hybridization, restriction enzyme digest and mapping, PGR 
mapping, ligase-mediated PGR detection, RNase protection, 
electrophoretic mobility shift detection, and chemical mismatch 
cleavage . 

102. A method as in claim 100, wherein a mutant presenilin 
protein is detected by an assay selected from the group 
consisting of an immunoassay, a protease assay, and an 
electrophoretic mobility assay. 

103 . A pharmaceutical preparation comprising a substantially pure 
presenilin protein and a pharmaceutically acceptable carrier. 

104. A pharmaceutical preparation comprising an expression vector 
operably encoding a presenilin protein, wherein said expression 
vector may express said presenilin protein in a human subject, 
and a pharmaceutically acceptable carrier. 

105. A phairmaceutical preparation comprising an expression vector 
operably encoding a presenilin antisense sequence, wherein said 
expression vector may express said presenilin antisense sequence 
in a human sxibject, and a pharmaceutically acceptable carrier. 
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106 . A pharmaceutical preparation comprising a substantially pure 
antibody, wherein said antibody selectively binds to a mutant 
presenilin protein, and a pharmaceutically acceptable carrier. 

107. A pharmaceutical preparation as in claim 106 wherein said 
preparation is essentially free of an antibody which selectively 
binds a normal presenilin protein. 

108. A pharmaceutical preparation comprising a substantially pure 
preparation of an antigenic determinant of a mutant presenilin 
protein. 

109. A pharmaceutical preparation as in claim 108 wherein said 
preparation is essentially free of an antigenic determinant of a 
normal presenilin protein. 

110 . A method of treatment for a patient bearing a mutant 
presenilin gene comprising the step of administering to said 
patient a therapeutically effective amount of the pharmaceutical 
preparation of any one of claims 103-109. 

111. A method as in claim 110, wherein said pharmaceutical 
preparation is targeted to a cell type is selected from the group 
consisting of heart, brain, lung, liver, skeletal muscle, kidney, 
pancreas and neurological cells. 



SUBSmUTE SHEET (RULE 25) 



wo 96/34099 



PCT/CA96/00263 




wo 96/34099 



PCT/CA96/00263 



2/6 




FIG. 2 



SUBSTITUTE SHEET (RUU 26) 



wo 9604099 



3/6 



PCT/CA96/00263 



hPS 1 MTELPAPLS YFQNAQMS EDNHLSNTVRSQNDNRERQEHNDRRSLGHPE 4 8 

III IIIIIIIIIIMIII I I lllli Ml II I II 

mPSl MTEIPAPLSYFQNAQMSEDSHSSSAIRSQMDSQERQQQKORQRLDNPE 48 

hPSl PLSNGRPQGNSRQWEQDEEEDEELTLKYGAKHVIMLFVPVTLCMWV 96 

I mill lllillllllillllllllllllllllllllllllllll 

mPSl PTSKTGRPOSWSRQVVEQnEEEDEELTLKYQAK HVTMLFVPVTLgMVW 96 



hPSl VATIKSVSFYTRKDGQLIYTPFTEDTETVGQRALHSILNAAIMISVIV 144 

MMIIiMIIIIIIIIMIIIIIIIIIIIIMIMIIIIIIIIIIII 

mPSl VATTKSVSPYTRKDGQLIYTPFTEDTETVGQRALHS ILNAAIMISVIV ~ 144 



hPSl . VMTILLWLYKYRCYKVIHAWLIISSLLLLFFFSFIYLGEVFKTYNVA 192 

llllllllllllllllllllllllllllllllllllliillilllll 
mPSl IMTILLVVLYl CYRgYICVI HAWLIISS TJJJ.PPyfiPTYT/3KVF^^ 192 



hPSl VDYITVALLIWNFGWGMISIHWKGPLRLQQAYLIMISALMALVFIKY 240 

III lllllllllllllll llllllllllllllllllllllllllll 
mPSl VDYVTVALLIWOTGVVailAIHWKGPLRMQAYLIMISALHftliVFIKY • 240 



hPSl LPEWTAWLIlAVISVYDLVAVIXrPKGPIJlMLVETAQERNETLFPAIJCY 288 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

mPSl LPE WTAWLILAVISVYDLVAVL CPKGPLRMLVETAQERMETLFPALIY 288 

hPSX SST^fVWLVNMAEGDPEAQRRVSKNSKYNAESTERESQDTVAENDDGGF 336 

iiiiiiiiiiiiiiiiiiiii II III III II mill 

mPSl SSTM\mLVNMA£GDPEAQRRVPKNPKYNTQRAERETQDSGSGNDDGGF 336 

hPSl SEEWEAQRDSHLGPHRSTPESRAAVQELSSSILAGEDPEERGVKLGLG 384 

iiiiiimiiiiiiiiiiiiiimm III miimiim 

mPS 1 SEEWEAQRDSHLGPHRSTPESRAAVQELSGS ILTSEDPEERGVKLGLG 384 



hPSl DFIFYSVLVGKASATA^GDWNTTIACFVAILIGLCLTLLLLAIFKKAL 432 

iiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiimiiim 

mPSl DF I FYS VLVGKAS ATA5GDWNTT I ACFVAI L F KICAL 432 

hPSl PALPISITFGLVFYFATDYLVQPFMDQLAFHQFYI 467 

llllllllllllllllllllimillllllllll 

mPSl PALPISITFGLVFYFATDYLVQPFMDQLAFHQFYI 467 



FIG. 3 



wo 96/34099 



PCT/CA96/00263 



V6 



hPSl MT-ELPAPLSYFQNA-QMSEDNHLSNTV 26 

I I I I I I III II 

hPS2 MtjTFMASDSEEEVCDERTSLMSAESPTPRSC-QEGRQGPEDGE--NTA 45 

hPSl - -RSQ-N- -DNRERQEHNDRR- -SLGHPEPLSNGRPQGNSRQWEQDE 67 

III I I I I II III INI I 

hPS2 OWRSQENEEDG-E- -EDPDRWCS-GVP GRPPGL E 76 



hPSl EEDEELTLKYGAJOCVIMLFVPVTLOlWVVATIKSVSFYTRKDGQLiy 115 

llllllllllllillllllllll IIMMIII III I Mil! 

hPS2 - - ■EELTLKYGAK HVIMLFVPVTI.gMIVWATT K<nreFVTBtOJ^^^ 121 



hPS 1 TPFTEDTETVGQRALHS ILNAAIMI SVI WMTILLWLYKYRCYKVIH 163 

11 II III MM II II IIIIIIIIMi lllllllllll II 

hPS2 TPPTEDTPSVGQRLtJlSVr.MTt>TMTflVTVVMTTP7.WT.YTr/PrWPTU 159 



hPSl AWLIISSLLLLFFFSFIYLQEVFKTYNVAVDYITVALLI-WNFGWGM 210 

Ml III III I IMIII llllll II I II MM III 

hPS2 GWLIMSSIJ1LLFr.PTYTYT.flEVLfCTYmyAM DYPTL-LLT^ 216 



hPSl ISIHWKGPLRLQQAYLIMISALMALVFIKYLPEWTAWLIL-AVISVYD 257 

MIIMI Mllllllllllllllllllllll II II I Mill 

hPS2 VCIHWKGP LVLQQAYLIMI5ALMAI.VPT KYLPE WSAWVTl^A-T<;\7Vn 263 



hPSl LVAVLCPKGPLRMLVETAQERNETLFPALIYSSTMVWLVNMAEGDPEA 305 

MIMMIMIIMIilMMII llllllll III I II II 

hPS2 LVayLCPKGPIJlMLVETAQERNEPIFPALiySSAMVWWGMAK^ 309 

hPSl QRRVSKNSKYNAESTERESQDTVAENDDGGFSE-EWEAQRDSHLG- PH 351 

II I I I 11 II I I 

hPS2 S - - SQGAL QLPY DP EME-EDSYDSF- GBP- 3 34 

hPSl RSTPESRAAVQE- - LSSSIIAGEDP- - -EERGVKLGLGDPIFYSVLVG 3 94 

I II III II IIMMMMMIIMIMI 

hPS2 - SYPE VFEPPLTGYP- -GEELEEEEERGVKLGLGDFIFYSVLVG 375 



hPSl KASATASGDWNTTIACFVAILIGLCLTLLLLAIFKKALPALPISITFG 442 

II II ilMIII IMIIIIIIIIIilMII lllllllllllllll 

hPS2 KAAATGSGDWim'IAgFVATI.TQLCT.TT.T.T.TAVPinf&T.PaT.PTgTTPr! 423 

hPSl LVFYFATDYLVQPFMDQLAFHQFYI 467 

I Ml II M MM II II M 

hPS2 LIFYFSTDNLVRPFMDTLASHQLYI 448 



FIG. 4 



wo 96/34099 



PCT/CA96/00263 




wo 96/34099 



PCT/CA96/00263 




WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCX 

INTERNATIONAL APPLICATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (POT) 



(51) International Patent Classification ^ : 

C12N 15/12, C07K 14/47, C12Q 1/68, 
AOIK 67/027, C12N 5/10, A61K 38/17 



A3 



(11) International Publication Number: WO 96/34099 

(43) International Publication Date: 31 October 1996 (31.10.96) 



(21) International Application Number: PCT/CA96/00263 

(22) International FUing Date: 29 April 1996 (29.04.96) 



(30) Priority Data: 
08/431.048 
08/496,841 
08/509,359 



28 April 1995 (28.04.95) US 
28 June 1995 (28.06.95) US 
31 July 1995 (31.07.95) US 



(71) Applicants {for all desigtuiied States except US): HSC RE- 

SEARCH AND DEVELOPMENT LIMITED PARTNER- 
SHIP [CA/CA]; 555 University Avenue, Toronto. Ontario 
M5G 1X8 (CA). THE GOVERNING COUNCIL OF THE 
UNIVERSITY OF TORONTO [CA/CA]; 106 Simcoe Hall. 
27 King's College Circle. Toronto, Ontario M5S lAl (CA). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): ST. GEORGE-HYSLOP. 
Peter, H. [CA/CA];'210 Richview Avenue. Toronto, Ontario 
M5P 3G3 (CA). FRASER. Paul. E, [CA/CA]; 611 Win- 
dermere Avenue. Toronto, Ontario M6S 3L9 (CA). ROM- 
MENS, Johanna, M. [CA/CAJ; 105 McCaul Street, Toronto, 
Ontario M5T 2X4 (CA). 

(74) Agent: RAE, PaQicia, A.; Sim & McBumey, 6th floor, 330 
University Avenue, Toronto, Ontario MSG 1R7 (CA). 



(81) Designated States: AL, AM, AT, AU, AZ, BB. BG, BR, BY. 
CA, CH. CN, CZ. DE, DK. EE, ES. FI. GB, GE. HU. IS. 
JP. KE, KG. KP. KR, KZ. LK. LR, LS. LT, LU, LV, MD. 
MG, MK, MN, MW, MX, NO. N2, PL, PT, RO, RU, SD, 
SE, SG. SI. SK, TJ. TM, TR. TT. UA, UG. US, UZ. VN, 
ARIPO patent (KE, LS. MW. SD, SZ, UG), Eurasian patent 
(AM. AZ. BY. KG, KZ, MD. RU, TJ, TM), European patent 
(AT. BE, CH. DE. DK. ES, FI, FR. GB. GR, IE, IT, LU, 
MC, NL. PT, SE). OAPI patent (BF, BJ, CF, CG, CI. CM. 
GA, GN. ML. MR, NE, SN. TD, TG). 



Published 

Wiih international search report 

Before the expiradon of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 

(88) Date of publication of the Intematioiial search report: 

S December 1996 (05.12.96) 



(54) Title: GENETIC SEQUENCES AND PROTEINS RELATED TO ALZHEIMER'S DISEASE, AND USES THEREFOR 



(57) Abstract 



The present invention describes the identification, isolation, sequencing and characterization of two human presenilin genes. PS-1 and 
PS-2, mutations which lead to Familial Alzheimer's Disease. Also identified are presenilin gene homologues in mice, C. elegans and D. 
melanogaster. Nucleic acids and proteins comprising orderived from the presenilins are useful in screening and diagnosing Alzheimer's 
Disease, in identifying and developing therapeutics for treatment of Alzheimer's Disease, and in producing cell lines and transgenic animals 
useful as models of Alzheimer's Disease. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


Annenia 


GB 


United Kingdom 


MW 


Malawi 


AT 


Atutria 


GE 


Geoigia 


MX 


Mexico 


AU 


Ausoalia 


GN 


Guinea 


NE 


Niger 


BB 


Barbados 


GR 


Greece 


NL 


Netherlands 


BE 


Belgium 


HU 


Himgary 


NO 


Norway 


BF 


BozfcinaFaso 


IB 


beland 


NZ 


New Zealand 


BG 


Balgaria 


IT 


Italy 


PL 


Poland 


BJ 


Benin 


JP 


Japan 


PT 


Ponugai 


BR 


Brazil 


K£ 


Kenya 


RO 


Romania 


BY 


Belarus 


KG 


Kyrgystan 


RU 


Russian Federation 


CA 


Canada 


KP 


Democratic People's Republic 


SD 


Sudan 


CF 


Central African Republic 




of Korea 


SE 


Sweden 


CG 


Congo 


KR 


Republic of Korea 


SG 


Singapore 


CH 


Switzerland 


KZ 


Kazakhstan 


SI 


Slovenia 


a 


COie d'Yvoire 


U 


Liechtenstein 


SK 


Slovakia 


CM 


CanienxHi 


LK 


Sri Lanka 


SN 


Senegal 


CN 


China 


LR 


Liberia 


5Z 


Swaziland 


cs 


Czechoslovakia 


LT 


Lithuania 


TD 


Chad 


cz 


Czech Republic 


LU 


Luxembourg 


TG 


Togo 


DE 


Germany 


LV 


Latvia 


TJ 


Tajikistan 


DK 


Denmark 


MC 


Monaco 


TT 


Trinidad and Tobago 


EE 


Estonia 


MD 


Republic of Moldova 


UA 


Ukraine 


ES 


Spain 


MG 


Madagascar 


UG 


Uganda 


n 


Finland 


ML 


Malt 


US 


United States of America 


FR 


France 


MN 


Mongolia 


uz 


Uzbekistan 


OA 


Gabon 


MR 


Mauritania 


VN 


Viet Nam 



INTERNATIONAL SEARCH REPORT 



Inter 'naal Application No 

PCl/CA 96/00263 



A. CLASSIFICATION OF SUBJECT MATTER. ^_ 

IPC 6 C12N15/12 C07K14/47 C12Q1/68 
A61K38/17 



AO1K67/027 C12N5/ie 



According to Ihternaiional Patent Qasafication (IPC) or to both national clasnfication and IPC 



B. FIELDS SEARCHED 



M inimiim documentation searched (dassiiication system followed by classification symbols} 

IPC 6 C07K C12N 



Documentation searched other than minimum documentation to the extont that such documents are included in the fields searched 



Bectnniic data base consulted during the intematiottal search (name of dau base axid» where practical, search terms used) 



C DOCUMENTS CONSIDERED TO BE RELEVANT 



Category' 



Gtalion of document^ wth indication, where approprialE, of the rdevant passages 



Relevant to claim No. 



NAT. GENET. (1992). 2(4), 330-4 CCDEN: 
NGENEC;ISSN: 1061-4036. XPeO2015738 
ST. GEORGE-HYSLOP, P. ET AL: "Genetic 
evidence for a novel familial Alzheimer 's 
disease locus on chromosome 14" 
cited in the application 
see the whole document 

NAT. GENET. (1992), 2(4), 335-9 CODEN: 

NGENEC;ISSN: 1061-4036, XP002015739 
VAN BROECKHOVEN, CHRISTINE ET AL: 
"Mapping of a gene predisposing to early - 
onset Alzheimer's disease to chromosome 
14q24.3" 

see the whole document 



-/- 



Further documents are listed in the continiiation of box C 



□ 



Patent family members are listed in annex. 



* Special categories of died documents : 

'A' document defining the general state of dK art which is not 

considered to be of paracular relevance 
'E* earlier document but published on or alterlhetnteinational 

filing dale 

'L' document which may throw doubts on jmority daimfs) or 
which is cited to establish the publication date of another 
dtation or other spedal reason (as spodfied) 

'O' document referring to an oral disdosuit, lae, exhibition or 
other means 

'P* document publidied prior to die international filing date but 
later than the priority date daimed 



T* later document published alter the international filing date 
or priority date and not in conflict with the applicanon but 
dted to understand the prindple or theory underiying the 
tnveiihoo 

'X' document of particular reievanoe; the daimed invention 
cannot be considered novd or carmot be considered to 
involve an inventive step when the document is taken alone 

'Y' doctiment of particular rdevanoe; the daimed invention 
carmot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
mentii such combination being obvious to a person skilled 
in the ait 

'ft' document member of the same patent family 



Date of the actual completion of the international search 



11 October 1996 



Dale of mailing of the intemaliottal search repcttt 



3 a lags 



Name and mailing address of the ISA 

European Patent Office, P.B. 581 8 PatenUaan 2 
NL - 2280 HV Rijswijk 
Td. (t 31-70) 340.2040. Tx. 31 6SI cpo nl. 
Fax: (i- 31-70) 340-3016 



Audiorized offico- 



Gurdjian, D 



FbiniPCT/ISA/3I0(i 



shctt) {July 1942) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



Inter '^onal AppUcation No 

PCl/CA 96/60263 



C^Contimiation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appro|»iate, of che relevant passages 


Relevant to claim No. 


A 
n 


ADV RES NEURODEGENER. (1994K 
2(ETI0PATH0GENESIS), 3-18 CODEN: 
ARNEFX;ISSN: 1068-719X, XP0O2G15746 
GEORGE-HYSLOP, P.H. ST.: "Genetic 
evidence for a novel familial Alzheimer 's 
disease locus on chromosome 14: analysis 
of candidate genes" 
see the whole document 


1 


P.X 


NATURE (LONDON), 375 (6534). 1995. 
754-760., 

June 1995, XP802015740 
SHERRINGTON R ET AL: "Cloning of gene 
bearing missense mutations in early-onset 
familial Alzheimer's disease" 
see the whole document 


1 


P.X 


NATURE GENETICS, 1995, 11/2 (219-222). 
October 1995, USA, XP002015741 
DE LA MONTE, SUZANNE M. WANDS, JACK R.: 
"The structure of the presenilin 1 (S182) 
gene and identification of six novel 
mutations in early onset AD families" 
see the whole document 


1 


P.X 


HUMAN MOLECULAR GENETICS, 4 (IZJ. l!>9o. 
2363-2371., 
- December 1995 XP002015742 
CRUTS M ET AL: "Molecular genetic 
analysis of familial early-onset 
Alzheimer's disease linked to chromosome 
14q24.3'' 

see the whole docunent 


1 
1 


P.X 


PROC. NATL. ACAD. SCI. U. S. A. (1995). 

Qofoa.\ i9ifio.A rnnra* dma^aa-t^^n* 

^^^^0/, l^J.oU'*<f UUUtn. rNnjnD,IOOn« 

0027-8424, 

December 1995, XP002015743 
LI, JINHE ET AL: "Identification and 
expression analysis of a potential 
familial Alzheimer disease gene on 
chromosome 1 related to AD3" 
see the whole document 


1,70 


P.X 


LANCET, 

vol. 347, January 1996, 

pages 130- 131 » XPOO2015744 

BOTEVA K. ET AL.: "mutation analysis of 

presenilin 1 gene in Alzheiirer's disease" 

see the whole document 


1,70 



I PCT/ISA/310 (canttnmtioa of s 



1 ilieet) (July 19S3) 



page 2 of 2 



